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Preface 



The material treated in this book was brought together for a PhD course I taught at 
the University of Pisa in the spring of 1999. It is intended to be an introduction to 
small divisors problems. The book is divided in two parts. In the first one I discuss 
in some detail the theory of linearization of germs of analytic diffeomorphisms of 
one complex variable. This is a part of the theory where many complete results 
are known. The second part is more informal. It deals with Nash-Moser's implicit 
function theorem in Frechet spaces and Kolmogorov-Arnol'd-Moser theory. Many 
results (and even some statements) are just briefly sketched but I always refer the 
reader to a choice of the huge original literature on the subject. 

I am particularly fond of the topics described in the first part, especially 
because of their interplay with complex analysis and number theory. The second 
part is also fascinating both because of its generality and because it leads to 
applications to Hamiltonian systems. Both are the object of major active research. 

These lectures contain many problems (some of which may challenge the 
reader) : they should be considered as an essential part of the text. The proof of 
many useful and important facts is left as an exercise. 

I hope that the reader will find these notes a useful introduction to the subject. 
However the reason of the long list of references at the end of these notes is my 
belief that the best way to learn a subject is to study directly the papers of those 
who invented it : Poincare, Siegel, Kolmogorov, Arnol'd, Moser, Herman, Yoccoz, 
etc. 

I am very grateful to Mariano Giaquinta for his invitation to give this series 
of lectures. I also wish to thank Carlo Carminati, whose enthusiasm is also at the 
origin of this project, and whose remarks have been essential in correcting some 
mistakes. 

Udine, December 8, 1999. 

Stefano Marmi 
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Part I. One-Dimensional Small Divisors. Yoccoz's The- 
orems 



1. Germs of Analytic Diffeomorphisms. Linearization 

A dynamical system is the action of a group (or a semigroup) on some space. In 
looking for the simplest cases we are led to ask for the lowest possible dimension 
of the ambient space together with the highest possible regularity of the action. 
A remarkably rich but elementary situation is obtained considering the group of 
germs of holomorphic local diffeomorphisms of C which leave the point z = fixed. 
In what follows we will omit the symbol o for the composition of two germs (unless 
some confusion may be possible). 

Let C[[z}] denote the ring of formal power series and C{z} denote the ring of 
convergent power series. 

Let G denote the group of germs of holomorphic diffeomorphisms of (C, 0) 
and let G denote the group of formal germs of holomorphic diffeomorphisms of 
(C, 0) : G = {/ G zC{z} , f(O) # 0}, G = {/ G zC[[z]] , /i # 0}. One has the 
trivial fibrations 



G — U\ e c*G\ 



C* 



G — U\ e c*G\ 



C* 



(1.1) 



where 



Gx = {f(z) = J2 fm n e C[[z]] , A = A} , 

Gx = {f(z) = J2 fnz n G C{Z} , /l = A} . 



(1.2) 
(1.3) 



n = l 



1.1 Conjugation, Symmetries 

Let Ad g / denote the adjoint action of g on / : Ad 3 / = g~ x fg. 

Definition 1.1 Let f G G (resp. f G G). We say that a germ g (resp. a formal 
germ g G G) is equivalent or conjugate to f (resp. f) if it belongs to the orbit of 
f (resp. f) under the adjoint action ofG\ (resp. G±) : 

fr ^ g ^ 3heGi: g = h~ 1 fh, 

f ~ g ^ 3h E G ± : g = hT 1 fh . 

The set of germs equivalent to / obviously forms an equivalence class, the orbit of 
/ under the adjoint action of G\ : 

[/] = Ad Gl / = {geG,3heG 1 :g = Ad h f = h' 1 fh} . 

The same holds in the formal case. 

Definition 1.2 A germ g G G is a symmetry of f G G if g G Cent(f), i.e. if 
Adgf = f. We will denote by Cent (/) the formal analogue of Cent (f). 

Exercise 1.3 Let / G G\ (resp. / G G\) and assume g ~ / (resp. g ~ /), i.e. 
/ = h~ x gh for some h G G\. Then show that 

(1) g G G\ (resp. g G G\) thus /i = f'(0) = A is invariant under conjugation. 

(2) Cent (/) is conjugated to Cent (g), i.e. Cent (/) = /i _1 Cent (g)h ; 

(3) / z = {r, nGZ}cCent(/). 

1.2 Linearization 

Let Rx denote the germ R\(z) = Xz. This is the simplest element of G\. It is easy 
to check that, if A is not a root of unity, its centralizer is Cent (R\) = {-R M , /U G 

C*}. 

Exercise 1.4 Let / G G\ and assume that A is not a root of unity. The morphism 

li : Cent (/) -► C* 

g^ Kg) '=9i =#'(o) 



is injective. [Hint : this is equivalent to showing that g G Gi, g G Cent (/) =>■ g — 
id . On the other hand if g G G M and g G Cent (/) one can recursively determine 
the power series coefficients of g : one has 

n— 1 n— 1 

(A n -A)# n = (fi n -fi)f n + ^2fj Y^ 9rn • • • g nj -^2 gj Yl fni--'fnj, 

j=2 n\-\-...nj—n i=2 ni + ...n,=n 

for all n > 2.] 

Definition 1.5 A germ / G Ga is linearizable if there exists hf G G\ (a 
linearization of f) such that hi fhf = R\, i.e. f is conjugate to (its linear part) 
R\. f is formally linearizable if there exists hf G G\ such that hi fhf = R\ (note 
that in this case this is a functional equation in the ring C[[z]] of formal power 
series). 

From Exercise 1.4 it follows that when A is not a root of unity the linearization (if 
it exists) is unique : if h\ and hi are two linearizations of the same / G G\ then 
hih^ 1 G ker \x. 

Our first result on the existence of linearizations will concern the case when 
A is a root of unity. 

Proposition 1.6 Assume A is a primitive root of unity of order q. A germ f G G\ 
is linearizable if and only if f q = id . The same holds for a formal germ f G G\. 

Proof. Assume that / is linearizable. Then z = X q z = (hi o f o hf) q {z) = 
(hi 1 o f q o hf)(z) from which one gets f q (z) = (hf o id o hl l )(z) = z. 

Conversely if f q = id then defining hi := - X11=o A _J / J one immediately 
checks that hi 1 G G\ if / G G\ (resp. hi 1 G G\_ if / G G\) and hi 1 o f o hf = R\. 

D 



1.3 Formal Conjugacy Classes 

In the formal case, all conjugacy classes of germs whose linear part is a root of 
unity are well known : 



Proposition 1.7 Let X be a primitive root of unity of order q. Let f G G\ and 

assume that f q 7^ id . Then there exists a unique integer n > 1 and two complex 
numbers a, b G C, a 7^ 0, such thai / is formally conjugated to 

Pn,a,b,x{z) = \z{l + az nq + a 2 bz 2nq ) . 



Exercise 1.8 Prove Proposition 1.7. Note that if one allows to conjugate also with 
homoteties then / is formally conjugated to P n , c ,\(z) = Xz(l + z nq +cz 2nq ). [Hint : 
the idea of the proof is to iterate conjugations by polynomials <Pj(z) = z + fljzi 
with j > 2 and suitably chosen (3j. See also [Ar3], [Be].] 

But in the formal case everything is very simple : 

Proposition 1.9 Assume that A is not a root of unity. Then G\ is a conjugacy 
class and G\ acts freely and transitively on G\. 

Proof. To see that any / G G\ is conjugate to R\ we look for hf G G\ such that 
fhf = hfR\. We develop and solve this functional equation by recurrence : we 
get, for n > 2 (denoting hf(z) = Y^Li h-nZ 71 , hi = 1) 



n 



hn =xTTx^ fj 2^ h ni ---h nj . (1.4) 

j=2 ni + ...+rij=n 

The action of G\ on G\ is free. This follows from the fact that the only germ 
tangent to the identity belonging to the centralizer of / is the identity (see Exercise 
1.4). Transitivity of the action is trivial : given two formal germs /1 and fa both in 
G\ there exist two formal linearizations hi and hi and clearly fa = Ad^ ^-^{fa)- 

U 

Collecting propositions 1.6, 1.7 and 1.9 together we have a complete classification 
of the conjugacy classes of G : 

(I) if A is not a root of unity then G\ is a conjugacy class ; 
(II) if A = e 2lTip / q , q > 1, {p, q) — 1 then the conjugacy classes in G\ are [R\] and 

{[Pn,a,b,\]}aeC* ,b€<C,n>l- 



1.4 Koenigs-Poincare Theorem 

In the holomorphic case the problem of a complete classification of the conjugacy 
classes is still open and, as Yoccoz showed, perhaps unreasonable. The first 
important result in the holomorphic case is the Koenigs-Poincare Theorem : 

Theorem 1.10 (Koenigs-Poincare) If |A| 7^ 1 then G\ is a conjugacy class, 
i.e. all f G G\ are linearizable. 

Proof. Since / is holomorphic around z = there exists c\ > 1 and r G (0, 1) 
such that \fj\ < c\r l ~i for all j > 2. Since |A| 7^ 1 there exists c 2 > 1 such that 
|A n - A| _1 < c 2 for all n > 2. 

Let (cr n ) n >i be the following recursively defined sequence : 

n 

0"1 = 1 , On = ^2 ^2 a n 1 ---On i - (1-5) 

j'=2 ni + ...+rij=n 

The generating function a(z) = Y^=i a nZ n satisfies the functional equation 

/ s o-(z) 2 



thus a(z) = 1+z ~ vi-§z+z j g ana iytic in the disk \z\ < 3 — 2\/2 and bounded and 
continuous on its closure. By Cauchy's estimate one has a n < 03(3 — 2y / 2) 1_n for 
some C3 > 0. 

Since A is not a root of unity, / is formally linearizable and the power series 
coefficients of its formal linearization hf satisfy (1.4). By induction one can check 
that \h n \ < (ciC2r _1 ) n_1 cr n , thus hf G C{z}. D 

Remark 1.11 Since the bound |A n — A| _1 < c 2 is uniform w.r.t A G D(Xq, 6), where 
Ao G C* \ S 1 and 5 < dist (Ao, S 1 ), the above given proof of the Poincare-Koenigs 
Theorem shows that the map 

C*\S X -^d 
A i—*' h?(X) 

is analytic 1 for all / G z 2 C{z}, where h?(X) is the linearization of Xz + f(z). 

The Poincare-Koenigs Theorem has the following straightforward generalization : 



1 This notion needs a little comment since C{z} is a rather wild space : it is 



Theorem 1.12 (Koenigs— Poincare with parameters) Let r > 0, let 

f JrXDrCPxC^CjfijZJH f{t, z) = f t (z) be an holomorphic map such 
that f (z) = X(0)z + 0(z 2 ), with |A(0)| g {0,1}. Then there exists r G (0,r), 
a unique holomorphic function z$ : O™ — ► C and a unique h : D" x D rQ — >■ C, 
(t, z) i— > /it(-z) = h(t, z) holomorphic such that for t G D" o one has the following 
properties : 

(i) f t (z (t)) = zo(t), ft(z) = \(t)(z - zo(t)) + 0((z- z (t)f), \X(t)\ ? {0, 1} ; 

(ii) ht(0) = z (t), h' t (0) = &ht\ z =o = 1 ; 
(in) h^ 1 ofoht = R\( t )- 

Proof, (sketch) The existence of zq and (i) follows easily from the implicit function 
theorem applied to F(t,z) = f(t,z) — z at the point (t,z) = (0,0) (note that 
F(0,0) = and j%ft(z)\(t,z)=(o,o) = A(0) - 1 # 0). Therefore there exists a 
unique fixed point for f t close to z = when t is close to depending analytically 
on t as t varies in a neighborhood of (t, z) = (0,0). Then one can consider 
gt(z) = ft(z + zo(t)) — zo(t) and apply the proof given above of the Koenigs- 
Poincare Theorem to gt(z). It is easy to convince oneself that the linearizing map 
depends analytically on t. □ 



1.5 Centralizers and Linearizations 

The study of centralizers generalizes the study of linearizability as the following 
exercises show : 

Exercise 1.13 Prove that if / G G\ is linearizable and A is not a root of unity 
then Cent (/) ~ C*. [Hint : use the fact that the centralizer of / is conjugate to 
the centralizer of R\ which is completely known.] 

Exercise 1.14 Prove that if g G Cent (/), g G G^ is linearizable and \x is not a root 



an inductive limit of Banach spaces, thus it is a locally convex topological vector 
space and it is complete but it is not metrisable, thus it is not a Frechet space 
(see Section 9.1). Here we simply mean that if A varies in some relatively compact 
open connected subset of C* \ S 1 then h?(X) belongs to some fixed Banach space 
of holomorphic functions (e.g. the Hardy space H°°(3 r ) of bounded analytic 
functions on the disk D r = {z G C, \z\ < r}, where r > is fixed and small 
enough) and depends analytically on A in the usual sense. 

8 



of unity then / is linearizable. [Hint : use that / G Cent (g) = {h g R v h g 1 , z/ G C*} 
and that v is invariant under conjugacy.] 

Exercise 1.15 Prove that if / G G\ and A is not a root of unity then / is 
linearizable if and only if Cent (/) ~ C*. [Hint : apply exercises 1.13, 1.4 and the 
Koenigs-Poincare Theorem] 

1.6 Cremer's Non-Linearizable Germs 

When |A| = 1 and A is not a root of unity we can write 

A = e 2lTia with ft GR\Qn (-1/2, 1/2) , (1.7) 

and whether / G G\ is linearizable or not depends crucially on the arithmetical 
properties of a. Let {x} denote the fractional part of a real number x : 
{x} — x — [x], where [x] is the integer part of x. 

Theorem 1.16 (Cremer) If limsup n ^ +00 |{na}| _1 / n = +oo then there exists 
f G G e 2TTia which is not linearizable. 

Proof. First of all note that lim sup n ^ +00 |{na}| _1 / n = +oo if and only if 

limsup|A n -ir 1/n = +oo 

n — >+oo 

since 

|A n - 1| = 2| sin(7raa)| G (2|{na}|, 7r|{na}|) . 

Then we construct / in the following manner : for n > 2 we take \f n \ = 1 and we 
choose inductively arg f n such that 

n-l 

arg f n = arg ^ fi ^2 h ni ■ • ■ h nj , (1.8) 

j=2 ni + ...+rij=n 

(recall the induction formula (1.4) for the coefficients of the formal linearization of 
/ and note that the r.h.s. of (1.8) is a polynomial in n — 2 variables ^ • • • > fn-i 
with coefficients in the field C(A)). Thus 

l/n| 1 



\hJ > 



\X n -l\ IA n -l| 



and limsup n ^_|_ 00 l/inl 1 '" = +oo : the formal linearization h is a divergent series. 

□ 

9 



Exercise 1.17 Write the decimal expansion of an irrational number a satisfying 
the assumption of Cremer's Theorem. 

Exercise 1.18 Show that the set of irrational numbers satisfying the assumption 
of Cremer's Theorem is a dense G§ with zero Lebesgue measure (following Baire, 
a set is a dense G$ if it is a countable intersection of dense open sets. These sets 
are "big" from the point of view of topology) . 

In the next Chapter we will continue our study of the problem of the existence 
of a linearization of germs of holomorphic diffeomorphisms. To this purpose the 
following "normalization" will be useful. 

1.7 Normalized Germs 

Let us note that there is an obvious action of C* on G by homotheties : 

(fi, /)£C*xGh Ad R J = R~ l fR» . (1.9) 

Note that this action leaves the fibers G\ invariant by Exercise 1.3. Also, / G G\ 
is linearizable if and only if Ad^ / is also linearizable for all \x G C* (indeed if 
hf linearizes / then AdR hf linearizes Ad_R /). Therefore, in order to study the 
problem of the existence of a linearization, it is enough to consider G/C*, i.e. 
we identify two germs of holomorphic diffeomorphisms which are conjugate by a 
homothety. 

Consider the space S of univalent maps F : D — *■ C such that F(0) = and 
the projection 

G^ S 

f if / is univalent in D 



f , , , _ 

' AdR r f if / is univalent in 

This map is clearly onto and two germs have the same image only if they coincide or 
if they are conjugate by some homothety. Thus this projection induces a bijection 
from G/C* onto S. 

In what follows we will always consider the topological space S of germs of 
holomorphic diffeomorphisms / : D — *■ C such that /(0) = and / is univalent in 
D. We will denote 

• S\ the subspace of / such that /'(O) = A ; 

• Sj the subspace of / such that |/'(0)| = 1. 

Clearly the projection above induces a bijection between G\/C* and S\. 

10 



2. Topological Stability vs. Analytic Linearizability 

The purpose of this Chapter is to connect the study of the conjugacy classes of 
germs of holomorphic diffeomorphisms to the theory of one-dimensional conformal 
dynamical systems and in particular to the notion of stability of a fixed point. The 
extremely remarkable fact is that stability, which is a topological property, will 
turn out to be equivalent to linearizability, which is an analytic property. 



2.1 Dynamics of Rational Maps 

Let us first of all recall the notion of normal family on an open subset U of the 
Riemann sphere C = CU{oo}. To this purpose we recall the usual system of 
coordinates on C determined by the stereographic projection : z : C \ {00} — ► C, 
z(0) = and w : C\ {0} — > C, w(oo) = 0, related by zw = 1. The spherical metric 
on C is defined as follows : 

{ -iii 12 in the z-chart ; 
2\di\ (2- 1 ) 

i+l^ 2 in the u;-chart ; 

Let U C C be open and Tjj — {/ : U — > C , / meromorphic}. We endow C with 
the spherical metric and T\j with the topology of uniform convergence on compact 
subsets of U. It is a classical result of Weierstrass that the limit of a convergent 
sequence in Tjj still belongs to Tjj (note that the constant function / = 00 is 
considered meromorphic) . 

Definition 2.1 A family JF c Tu is normal if it is relatively compact in Tjj, i.e. 
amy sequence {f n } C T contains a subsequence which converges uniformly in the 
spherical metric on compact subsets of U. 

Warning ! If {f n } is a normal family then {f^} needs not be normal : e.g. 
f n (z) = n(z 2 — n) on C. 

By means of the Ascoli-Arzela theorem one gets : 

Proposition 2.2 

(I) A family of meromorphic functions on U is normal on U if and only if it is 
equicontinuous on every compact subset of U ; 

11 



(II) A family of analytic functions on U is normal on U if and only if it is locally 
uniformly bounded (i.e. uniformly bounded on every compact subset ofU). 

Proof. The first statement is obvious since the compactness of C guarantees that 
the family is uniformly bounded. The second statement follows from Cauchy's 
integral theorem. □ 

The notion of normal family allows us to introduce the basic notions of one- 
dimensional holomorphic dynamics. Here we are interested in studying the 
dynamics of a discrete dynamical system (i.e. an action of N) on the Riemann 
sphere C generated by a holomorphic transformation R : C — ► C, i.e. and element 
of End (C). 

Let d denote the topological degree of R. We will assume d > 2 thus R is a 
d-fold branched covering of the Riemann sphere and can be written in a unique 
way in the form R(z) = tjtjt, where P(z) G C[z], Q(z) G C[z] have no common 
factors and d = max(degP, degQ). In fact every d : 1 conformal branched covering 
of C comes from some such rational function and 

End (C) = {R : C -> C holomorphic} = {R : C -> C , R{z) = P(z)/Q(z)} . 

Note that 

R(z) = < oo if Q(z) =0, 

[ lim^oo gg if z = oo. 

We define the iterates R n of R as usual : R n = RoR 71-1 . Note that R n has degree 
d n . 

Given a point zq G C the sequence of points {z n } n >o defined by z n+ i = R(z n ) 
is called the orbit of zq. A point zq is a fixed point of R if R(zq) = zq, periodic 
if z n = R n (zo) = zq for some n (the minimal n is the period). The orbit 
{zi,...,z n = zo} is called a cycle. The point zq is called preperiodic if z\~ is 
periodic for some k > 0. 

The fundamental dichotomy of C associated to the dynamics of R is the 
following : 

Definition 2.3 The Fatou set F(R) of R is the set of points zq G C such that 
{R n } n >o is a normal family in some disk D(zo,r) (w.r.t. the spherical metric). 
The complement of the Fatou set is the Julia set J(R). 

12 



Exercise 2.4 Show that J(R) = J(R n ) for all n > 1 and that J(R) is nonempty 
and closed. [Hint : if J(R) = then {-R n } n >o is a normal family on all C thus 
R n i — ► S G End (C). Compare degrees.] 

2.2 Stability 

Definition 2.5 A point zq G C is stable if for all 5 > there exists a neighborhood 
W of z such that for all z E W and for all n > one has d(R n (z), R n (z )) < S 
(here, as usual, d denotes the spherical metric). 

Exercise 2.6 Show that a point is stable if and only if it belongs to the Fatou set. 

Exercise 2.7 Let R G End (C) and assume that -R(O) =0. If R is linearizable and 
-R'(O) = A, |A| < 1 then is a stable fixed point. 

If we consider the more general situation of a germ / G S, i.e. / : D — *■ C 
injectively and holomorphically, /(0) = 0, the definition of stability must be 
slightly generalized so as to take into account the fact that the iterates of / are 
not necesaarily defined for all n. 

Definition 2.8 is stable if and only if there exists a neighborhood UofO such 
that f n is defined on U for all n > and for all z G U and n > one has 
\f n (z)\<l. 

Exercise 2.9 Show that if / is a rational map with a fixed point then definitions 
2.8 and 2.5 are equivalent. 

Exercise 2.10 If /'(0) = A and |A| > 1 then is not stable. 

To each germ / G S, |/'(0)| < 1, one can associate a natural /-invariant compact 

set 

G K f := f| f~ n (B) . (2.2) 

n>0 

Let Uf denote the connected component of the interior of Kf which contains 0. 
Then is stable if and only if Uf ^ 0, i.e. if and only if belongs to the interior 
otKf. 

Exercise 2.11 Show that if / G S and |/'(0)| < 1 then is stable. [Hint : consider 
a small disk around on which the inequality |/(z)| < p\z\ with p < 1 holds.] 

13 



2.3 Stability vs. Linearizability 

The main result of this section is the equivalence (for |/'(0)| < 1) of stability (a 
topological notion) and linearizability (an analytical notion) : 

Theorem 2.12 Let f G S, |/'(0)| < 1. is stable if and only if f is linearizable. 

Proof. The statement is non-trivial only if A = /'(0) has unit modulus. If / is 
linearizable then the linearization hf maps a small disk D r around zero conformally 
into D. Since hf(0) = and \f n (z)\ < 1 for all z G /i/(O r ) one sees that is stable. 
Conversely assume now that is stable. Then Uf ^ and one can easily see 
that it must also be simply connected (otherwise, if it had a hole V, surrounding 
it with some closed curve 7 contained in Uf since |/ n (z)| < 1 for all z G 7 and 
n > the maximum principle leads to the same conclusion for all the points in V 
thus V C Uf). Applying the Riemann mapping theorem to Uf one sees that by 
conjugation with the Riemann map / induces a univalent map g of the disk into 
itself with the same linear part A. By Schwarz' Lemma one must have g(z) = Xz 
thus / is analytically linearizable. □ 

When A = f'(0) has modulus one, is not a root of unity and is stable then 
Uf is conformally equivalent to a disk and is called the Siegel disk of / (at 0). 
Thus the Siegel disk of / is the maximal connected open set containing on 
which / is conjugated to R\. The conformal representation hf : D c (/) —* Uf of 
Uf which satisfies hf(0) = 0, h'f(0) = 1 linearizes / thus the power series of hf 
and hf coincide. If r(f) denotes the radius of convergence of the linearization hf 
(whose power series coefficients are recursively determined as in (1.4)), recalling 
the definition of conformal capacity (Exercise Al.4, Appendix 1) we see that : 

(i) if r(f) > then < c(U f , 0) = c(f) < r(f) ; 

(ii) if r(f) = then c(f) = r(f) = 0. 

Exercise 2.13 Show that the map c : Sj — ► [0, 1] which associates to each germ 
/ the conformal capacity of Uf w.r.t. is upper semicontinuous. 

When is not stable and A is not a root of unity Kf is called a hedgehog. 

We conclude this introduction to Siegel disks with two results on their 
conformal capacity. 
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Proposition 2.14 One has c(f) = r(f) when at least one of the two following 
conditions is satisfied : 

(i) Uf is relatively compact in D ; 

(ii) each point ofS 1 is a singularity of f. 

Proposition 2.15 Let A e T and assume that A is not a root of unity. Then 



inf c(f) = inf r(f) 



For the proofs see [Yo2, p. 19]. 
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3. The Quadratic Polynomial. Yoccoz's Proof of Siegel 
Theorem 

In this Chapter we will show the special role played by the quadratic polynomial 

Px(z) = \ (z ~ y) ■ (3.1) 

Indeed P\ is the "worst possible perturbation of the linear part R\" as the following 
theorem shows 

Theorem 3.1 (Yoccoz) Let A = e 2 ™, a G R \ Q. If P\ is linearizable then 
every germ f G G\ is also linearizable. 

Proof. Let / G G\, f(z) = Xz + *Y^ = <ifnZ n . By conjugating with some 
homothety one has |/ n ] < 10 _3 4 _n . We now consider the one-parameter family 
fb(z) = Xz+bz 2 +J2^=2 fnZ n - Note that /o = /. Since A is not a root of unity there 
exists a unique formal germ hb G G\ such that h b ~ 1 fbhb = R\- Its power series 
expansion is hb{z) = z + J2^ = 2 h n (b)z n with h n (b) G C[b]. Thus by the maximum 
principle one has |/i n (0)| < max |6|=i/2 \h n (b)\- If \b\ = 1/2 then (possibly after 
conjugation with a rotation) fb(z) = P\(z) + J2^=2 f nZn = ^( z ) + VK- 2 ) an d 
it is immediate to check that sup z6B3 \ip(z)\ < 10~ 2 . From Douady-Hubbard's 
theorem on the stability of the quadratic polynomial (Appendix 1) it follows that 
fb is quasiconformally conjugated to P\. If P\ is linearizable then is stable for 
P\ thus also for fb since a quasiconformal conjugacy is in particular a topological 
conjugacy. But we know that this implies that fb is linearizable. Therefore there 
exists two positive constants C and r such that \h n (b) \ < Cr~ n for all b of modulus 
1/2, thus | /i n (0)| < Cr~ n . Then ho converges and /o = / is linearizable. □ 



3.1 Yoccoz's Linearization Theorem for the Quadratic Polynomial 

Once one has established that the linearizability of the quadratic polynomial for 
a certain A implies that G\ is a conjugacy class the following remarkable theorem 
of Yoccoz shows that G\ is a conjugacy class for almost all A G T. 

Theorem 3.2 Let X = e 2lvioL , a G R \ Q. For almost all X G T the quadratic 
polynomial P\ is linearizable. 
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This statement deserves a comment. As we will see in Chapter 5 this theorem of 
Yoccoz is indeed weaker than the Siegel [S] and Brjuno [Br] theorems which date 
respectively to 1942 and 1970. What is very remarkable is the proof of Theorem 
3.2 which does not need any subtle estimate on the growth of the coefficients 
of the formal linearization as provided by (1.4) : compare with the proof of the 
Siegel-Brjuno Theorem given in Section 5.1. 

Let us note that P\ has a unique critical point c = 1 (apart from z = oo) and 
that the corresponding critical value is v\ = P\{c) = A/2. If |A| < 1 by Koenigs- 
Poincare theorem we know that there exists a unique analytic linearization H\ of 
P\ and that it depends analytically on A as A varies in D. Let r 2 (A) denote the 
radius of convergence of H \ . One has the following 

Proposition 3.3 Let A G D. Then : 

(1) r 2 (A)>0; 



(2) r 2 (A) < +oo and H\ has a continuous extension to O r2 ( A ). Moreover the map 
H\ : O r2 ( A ) — ► C is conformal and verifies P\ o H\ = H\ o R x . 

(3) On its circle of convergence {z , \z\ — r 2 (A)}, H\ has a unique singular point 
which will be denoted u(X). 

(4) H\(u(\)) = 1 and (H\(z) — l) 2 is holomorphic in z = u(X). 

Proof. The first assertion is just a consequence of Koenigs-Poincare theorem. 

The functional equation P\(H\(z)) = H\(Xz) is satisfied for all z G O r2 ( A ). 
Moreover H\ : D r2 (A) —* C is univalent (if one had H\(z\) = H\{z2) with z\ ^ Z2 
and z\,Z2 G O r2 (A) one would have H\(\ n zi) = H\(\ n Z2) for all n > which is 
impossible since |A| < 1 and H' x (0) = 1). Thus r 2 (A) < +oo. On the other hand 
if H\ is holomorphic in D r for some r > and the critical value v\ ^ H\(B r ) 
the functional equation allows to continue analytically H\ to the disk D|^|-i r . 
Therefore there exists u(X) G C such that \u(X)\ = r 2 (A) and H\(Xu(X)) = v\. 
Such a u(X) is unique since H\ is injective on O r2 ( A ). If \w\ = |A|r 2 (A) and 
w 7^ Xu(X) one has H\(w) = P\(H\(X~ 1 w)) and 

^(A"^) = 1 - v /l-2X- 1 H x (w) (3.2) 



which shows how to extend continuously and injectively H\ to D r2 ( A ). By 
construction the functional equation is trivially verified. This completes the proof 

of (2). 
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To prove (3) and (4) note that from H x (Xu(X)) = P x (H x (u(X))) it follows 
that H\(u(\)) = 1. Formula (3.2) shows that all points z G C, \z\ = ^(A) are 
regular except for z — u(X). Finally one has (H x (z) — l) 2 = 1 — 2X~ 1 H x (Xz) which 
is holomorphic also at z — u(X). D 

The fact that H x is injective on O r2 (A) implies that r2(A) < +oo (otherwise it 
would be a biholomorphism of C thus an affine map). A more precise upper 
bound is provided by the following 

Lemma 3.4 (a priori estimate of r2(A)). ^(A) < 2. 

Proof. It is an easy consequence of Koebe 1/4-Theorem. Indeed if / G S\ and 
t > then / = R^fRt : B t -i -»■ C is univalent and /(B t -i) = R t -if(B). By 
Koebe 1/4-Theorem one has B 1/4 C /(D) thus D t -i/ 4 C /(B t -i). But we know 
that v x £ H x (B WraW ) thus K| = l^i > J^^. □ 

Exercise 3.5 Show that r2(A) < 8/7. [Hint : apply (Al.l) to the function 

H x (z)(l + ^^H x (z)\ eSi, 

where H x {z) = H ^ Z K ] 

Exercise 3.6 Show that the image by H x of its circle of convergence is a Jordan 
curve, analytic except at H x (u(X)) = 1 where it has a right angle. 

Proposition 3.7 u : B* — > C has a bounded analytic extension to B. Moreover 
it is the limit of the sequence of polynomials u n (X) = X~ n P^(l) uniformly on 
compact subsets of B. One has u(0) = 1/2. 



Proof. By Proposition 3.3 one has P x (l) = H x (X n u(X)). From Lemma 3.4 and 

u(X) 



Koebe's distorsion estimates (specifically (Al.4) ) applied to H x (z) — ■■ x -"- 



one has 



\\\n |\| 

\PZ(1)\ = \u(X)H x (X n )\ < r 2 (A) - ! L < 2 - '' 



(1-|A|-) 2 " (1-|A|-) 2 ' 

thus |it n (A)| < 2(1 — |A|) -2 for all A G B and the polynomials u n verify the 

recurrence relation 

A n 
u (A) = 1 , u„+i(A) = w„(A) - — («n(A)) 2 . (3.3) 
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This shows that u n converges uniformly on compact subsets of D. The limit is u 
since 

lim « n (A) = lim \- n H x (\ n u(\)) = u(X) . 

Finally from |u(A)| = ^(A) and Lemma 3.4 one has |w(A)| < 2 on D. D 

The function u : D — ► C will be called Yoccoz's function. It has many remarkable 
properties and it is the object of various conjectures (see Section 5.3). 

Exercise 3.8 Check that : 
1) u(A)-u n (A) = 0(A»); 

o\ (\\ _ 1 A A 2 A 3 9A 4 A 5 7A 6 , 3A 7 29A 8 A 9 , 25A 10 , 559A 11 , 

L) a\/\) 2 8 8 16 128 128 128^256 1024 256 ' 2048 ' 32768 '■■■■> 

3) u(X) G Q{A} and all the denominators are a power of 2. 
Write a computer program to calculate the power series expansion of u and use 
it to design the level sets of log |tt| and arg-u. Try to compute the graph of 
6 i— > arg-u(re 27ri61 ) as r — > 1—. You may use some formulas given in [Yo2, pp. 
70-71] and to compare with [MMY2] . If you get nice pictures I would like to get 
a copy of them. 



3.2 Radial Limits of Yoccoz's Function. Conclusion of the Proof 

Proposition 3.9 Let Ao G T and assume that Xq is not a root of unity. Then 
r 2 (A ) > limsup D 3 A ^ A() |u(A)|. 

Proof. Let r = limsup ]D) 3 A ^ Ao |w(A)|. It is not restrictive to assume r > 0. Let 
(A n ) n >i C D such that X n — > Ao and |w(A n )| — >■ r. Since the linearizations H\ n 
are univalent on their disks of convergence D r2 ( Ari ) one can extract a subsequence 
uniformly convergent on the compact susbets of D r . The limit function H verifies 
H(0) = 0, if' (0) = 1 and H(X z) = P Xo (H(z)) (this is immediate by taking the 
limit of the corresponding equations for A n ). Thus H\ = H and r2(Ao) > r. D 

Yoccoz has indeed proved the following stronger result [Yo2, pp. 65-69] 

Theorem 3.10 For all Xq G T, |w(A)| has a non-tangential limit in Ao which is 
equal to the radius of convergence r2(Ao) of H\ . 

Of course, if Ao is a root of unity then P Ao is not even formally linearizable and 
one poses r2(Ao) = 0. 
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Collecting Propositions 3.3, 3.7 and 3.9 together one can finally prove Theo- 
rem 3.2. 

Proof, of Theorem 3.2. Applying Fatou's Theorem to u : D — *■ C one finds that 
there exists u* G L°°(T, C) such that for almost all Ao 6 T one has |it*(Ao)| > 
and u(X) — ► u(Ao) as A — > Ao non tangentially. From Proposition 3.7 one concludes 
that for almost all Ao G T one has r2(Ao) > 0. □ 

Remark 3.11 Continuing the above argument of Yoccoz, L. Carleson and P. 
Jones prove that for almost all A G T the critical point z = 1 of P\ belongs to the 
boundary of the Siegel disk (see, for example, [CG]). This has also been proved 
directly by M. Herman under the assumption that a is diophantine [He3]. M. 
Herman has also shown that there are A's for which the critical point is not on the 
boundary of the Siegel disk (even though the boundary is a quasicircle) [Do] . 
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4. Douady-Ghys' Theorem. Continued Fractions and 
the Brjuno Function 

From Yoccoz's theorem it follows that G\, A = e 27Tia , a G R \ Q, is a conjugacy 
class for almost all values of a. Let y denote the set of a G K. \ Q such that G e 2-*ia 
is a conjugacy class. Then we already know that y has full measure but that its 
complement in R \ Q is a G<s -dense (Exercise 1.18). The goal of this Section is to 
prove a result due to Douady and Ghys on the structure of y (Section 4.1) and to 
introduce various sets of irrational numbers (Sections 4.3 and 4.4) which have the 
same properties of y. Our main tool will be the use of continued fractions (see 
Appendix 2 for a short introduction). 

4.1 Douady-Ghys' Theorem 

We recall that SL (2, Z) is the group of matrices g = I , J with integer 

coefficients a, 6, c, d such that ad — be = 1. It acts onlU {oo} (thus on y too) as 
usual : g ■ a = 22±*. SL (2, Z) is generated by T = I J j J , T • a = a + 1, and 

t/= (l ~ 1 )^-« = - 1 /«- 

Further information on the structure of y is provided by the following 

Theorem 4.1 (Douady— Ghys) y is SL (2, Z) -invariant. 

Proof, (sketch), y is clearly invariant under T, thus we only need to show that if 
a G y then also U • a = —\ja G 3^- 

Let / G S e 2TTi a and consider a domain V' bounded by 

1) a segment / joining to Zq G D*, / C D ; 

2) its image f(l) ; 

3) a curve /' joining zo to f(zo). 

We choose /' and zq (sufficiently close to 0) so that /, I' and /(/) do not intersect 
except at their extremities. Note that / and /(/) form an angle of 2na at 0. 
Then glueing / to /(/) one obtains a topological manifold V with boundary 
which is homeomorphic to D. With the induced complex structure its interior is 
biholomorphic to D. Let us now consider the first return map gy to the domain V 
(this is well defined if z is choosen with \z\ small enough) : if z G V (and \z\is small 
enough) we define gv'( z ) — f n ( z ) where n (depends on z) is defined asking that 
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f(z), . . . , r^iz) <£ V and f n (z) e V, i.e. n = inf{k e N , k > 1 , f k (z) e V'}. 
Then it is easy to check that n = [— ] or n = [— ] + 1. The first return map gy> 
induces a map gy on a neighborhood of G V^ and finally a germ g of holomorphic 
diffeomorphism at G D (gp = pgy, where p is the projection from V to the disk 
O). It is easy to check that g(z) = e~ 2nt / a z + 0(z 2 ) (note that in the passage 
from V to D through V the angle 2-rra at the origin is mapped in 2n). 

To each orbit of / near corresponds an orbit of g near 0. In particular 

• / is linearizable if and only if g is linearizable ; 

• if / has a periodic orbit near then also g has a periodic orbit ; 

• if / has a point of instability (i.e. a point which does not belong to Kf) then 
also g has a point of instability (which, after having normalized g so as to be 
univalent on D, will leave the unit disk even more rapidly). 

In particular these statements show that a G y if and only if —1/a G y. □ 



4.2 SL(2,Z) and Continued Fractions 

To better understand the action of SL (2, Z) on R \ Q we can introduce a 
fundamental domain [0, 1) for one of the two generators (the translation T) and 
restrict our attention to the inversion a h-> 1/a restricted to [0, 1). This gives us 
a "microscope" since a h- > 1/a is expanding on [0, 1), i.e. its derivative is always 
greater than 1. Our microscope magnifies more and more asa^ 0+ and leads to 
the introduction of continued fractions discussed in Appendix A2. 

Exercise 4.2 Show that given any pair x, y G R \ Q there exists g G SL (2, Z) 
such that x = g ■ y if and only if x = [oo, a±, . . . ,a m , Co, Ci, . . .] and y = 
[&o, b±, . . . , b n , Co, ci, . . .]. [Hint : it is easy to check that the condition is sufficient 
for having x — g ■ y ; necessity is more tricky, see [HW] pp. 141-143.] 

Exercise 4.3 Show that if a; is a quadratic irrational, i.e. x G K \ Q is a zero of 
a monic quadratic polynomial equation with coefficients in Q, then there exists 
A^ G N such that the partial fractions a n of x are bounded a n < N for all n > 0. 

The two main results which make continued fractions so useful in the study of 
one-dimensional small divisors problems are the following 

Theorem 4.4 (Best approximation) Let x G K \ Q and let p n /q n denote its 
n-th convergent. If < q < q n +i then \qx — p\ > \q n x — p n \ for all p G Z and 
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equality can occur only if q = q n , p = p n . 



Theorem 4.5 If 



x-2 



< ^2 then - is a convergent of x. 



For the proofs see [HW], respectively Theorems 182, p. 151 and 184, p. 153. 

4.3 Classical Diophantine Conditions 

Let 7 > and r > be two real numbers. 



Definition 4.6 x G 1 \ Q is diophantine of exponent r and constant 7 if and 
only if for all p, q G Z, q > 0, one has x — - > 75 



-2-r 



Remark 4.7 Note that Theorem 4.5 implies that given any irrational number 



there are infinitely many solutions to 



x — 



< 4y with p and q coprime. This 



explains why the previous definition would never be satisfied if r < 0. 



x — 



> 79" 



-2-T 



We denote CD (7, r) the set of all irrationals x such that 

for all p, q G Z, q > 0. CD (r) will denote the union U 7 >oCD (7, r) and 

CD =U T > CD(r). 

Exercise 4.8 Show that 

CD (r) = {x G R \ Q I Q n+ i = 0(^ +T )} = {xGl\Q| a n+1 = 0(0} 



{iel\ 



x. 



og9-: 1 )} = {x6r\qi &- 1 = oG0-ir)} 



Exercise 4.9 Show that if a; is an algebraic number of degree n > 2, i.e. 
x G K \ Q is a zero of a monic polynomial with coefficients in Q and degree 
n, then x G CD (n — 2) (Liouville's theorem). Thue improved this result in 1909 
showing that x G CD (r — 1 + n/2) for all r > (see [ST], Chapter V, for a very 
nice discussion of the proof in the cubic case) . Actually one can prove that if x is 
algebraic then x G CD (r) for all r > regardless of the degree, but this is difficult 
(Roth's theorem). 

Exercise 4.10 Using the fact that the continued fraction of e = X^°° -L [ s 

[2,1,2,1,1,4,1,1,6,1,1,8,1,1,10,...] 
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show that e G n r >oCD(r). A proof of the continued fraction expansion of e, 
which is due to L. Euler, can be found in [L], Chapter V. Perhaps you may like to 
try to obtain it yourself starting from the knowledge of the continued fraction of 
f±i = [2,6,10,14,...]. 

Exercise 4.11 Use the result of Exercise 4.9 to exhibit explicit examples of 
trascendental numbers, e.g. x = Yl^=o 10 _n '- 

The complement in R \ Q of CD is called the set of Liouville numbers. 

Exercise 4.12 Show that CD (r) and CD are both SL (2,Z)-invariant. 

Proposition 4.13 For all 7 > and r > tie Lebesgue measure of 
CD (7, r) (modi) is at least 1 — 27^(1 + r), where ( denotes the Riemann zeta 
function. 

Proof. The complement of CD (7, r) (modi) is contained in 

(P -2-T P , -2-T 

whose Lebesgue measure is bounded by 

00 q 00 

g=l p=l q=l 

D 
From the point of view of dimension one has (see [Fa], p. 142 for a proof) 

Theorem 4.14 (Jarnik) Let r > and let F T be the set of real numbers x G [0, 1] 
such that {qx} < q~ l ~ T for infinitely many positive integers q. The Hausdorff 
dimension of F T is 2/(2 + r). 

Exercise 4.15 The set CD (0) is also called the set of numbers of constant type 
since x G CD (0) if and only if the sequence of its partial fractions is bounded. 
Show that CD (0) has Hausdorff dimension 1 and zero Lebesgue measure. 

Exercise 4.16 Show that the set of Liouville numbers has zero Lebesgue measure, 
zero Hausdorff dimension but it is a dense Gj-set 
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4.4 Brjuno Numbers and the Brjuno Function 

Let x E R\Q, let ( — J denote the sequence of its convergents and let (0 n )n>-i 

\Wn>0 

be denned as in (A2.14). 

Definition 4.17 x is a Brjuno number if B(x) := Y^=o fin-i^ogx^ 1 < +00. 
The function B : R \ Q — > (0, +00] is called the Brjuno function. 

Exercise 4.18 Show that all diophantine numbers are Brjuno numbers. 

Exercise 4.19 Show that there exists C > such taht for all Brjuno numbers x 
one has 

log q n + 1 



B{x) - £ 



n=0 



<c . 



Exercise 4.20 (see [MMY]) Show that the Brjuno function satisfies 

B(x) = B(x + l), VxGR\Q 

fl\ (4-1) 

B(x) = -\ogx + xB I - J , iel\Qn(0,l) 

Deduce from this that the set of Brjuno numbers is SL (2, Z)-invariant. Use the 
above given functional equation to compute B(x p ), where x p = — — ^ , p e N. 

Exercise 4.21 (see [MMY]) Show that the linear operator (Tf)(x) = xf (-), 

x E (0,1), acting on periodic functions which belong to L p ( T, ( 1+ ^f log2 ) nas 

spectral radius bounded by g = v ^ 5 2 ~ 1 . Conclude that the Brjuno function 
B E n p >iLP(T). Note that B <£ L°°(T). 

Exercise 4.22 Write the continued fraction expansion of a Brjuno number which 
is not a diophantine number. The same for the decimal expansion. Is Xl^Lo 10 -n ' 
a Brjuno number ? What about X^^Lo 10~ 10 ? 

Exercise 4.23 Let a > 0. Use the results of Exercise 4.21 to study the functions 

B ia \x) = B^\x + l) , VxGR\Q 

B (ff) (x) = i" 1/ff + iB W (-] , ieR\Qn(0,l) 
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Show that if B^ a \x) < +00 then x G CD (a). Viceversa, if x & CD (r) then 
B^ u \x) < +00 for all a > r. 
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5. Siegel-Brjuno Theorem, Yoccoz's Theorem and Some 
Open Problems 

Recall that y denotes the set of a 6 1 \ Q such that G e ^ia is a conjugacy 
class. Here we list what we already know about it 

• y has full measure (Chapter 3) ; 

• the complement of y in K. \ Q is a G^-dense (Exercise 1.18) ; 

• y is invariant under the action of SL (2, Z) (Douady-Ghys' Theorem, Chapter 

4). 

The purpose of this Chapter is to prove the classical results of Siegel [S] and 
Brjuno [Br] which show that : 

all Brjuno numbers belong to y 

Moreover we will state the Theorems of Yoccoz [Yo2] and in particular his 
celebrated result : 

y is equal to the set of Brjuno numbers. 

We will also mention several open problems. 

For the sake of brevity, starting with this section all the proofs will be only 
sketched : the reader can try autonomously to fill in the details but we will always 
refer to the original literature where complete proofs are given. 

5.1 Siegel-Brjuno Theorem 

The theorem of Siegel and Brjuno says that the set of Brjuno numbers is a subset 
of y. Indeed in 1942 C.L. Siegel was the first to show that y is not empty showing 
that CD C y. 

We will sketch the proof of a more precise result which follows from the 
Theorem of Yoccoz which we will discuss in the next section but which can also 
be proved following the classical majorant series method (see [CM]). Let us recall 
that S\ denotes the topological space of all germs of holomorphic diffeomorphisms 
/ : D -»■ C such that /(0) = 0, / is univalent on D and /'(0) = A. By Theorem 
Al.19 it is a compact space. Given a germ / G S\ we let r(f) indicate the radius 
of convergence of the linearization hf of / and we set 

J t O 2-Kia. 
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Theorem 5.1 (Yoccoz's lower bound) 

logr(a) > -B(a)-C (5.2) 

where C > is a universal constant (independent of ct) and B is the Brjuno 
function. 

Before of sketching the proof let us briefly mention what is the main difficulty 
which was first overcome by Siegel and which was clearly well-known among 
mathematicians at the end of the 19th and at the beginning of the 20th century 
(in 1919 Gaston Julia even claimed, in an incorrect paper, to disprove Siegel's 
theorem). 

Assume that ct G CD (r) for some r > 0. Recalling the recurrence (1.2) for 
the power series coefficients of the linearization h/(z) = Y^m=i h n z n 



1 






hi = 1 , h n = _ 2^ fj 2_^ h ni --- h nj n > 2 (5.3) 

j—2 ni + ...+rij=n 

one sees that h n is a polynomial in f 2 , . . . , f n with coefficients which are rational 
functions of A : h n G C(A)[/2, . . . , f n ] for all n > 2. 

Let us compute explicitely the first few terms of the recurrence 

h 2 = (\ 2 -\)- 1 f 2: 

/i 3 = (A 3 -A)- 1 [/3 + 2/|(A 2 -A)- 1 ], 

h 4 = (A 4 - A)- 1 ^ + 3/ 3 / 2 (A 2 - A)" 1 + 2/ 2 / 3 (A 3 - A)" 1 



(5.4) 



and so on. It is not difficult to see that among all contributes to h n there is always 
a term of the form 2 n - 2 / 2 n_1 [(A n - A) ... (A 3 - A) (A 2 - A)]" 1 . If one then tries to 
estimate \h n \ by simply summing up the absolute values of each contribution then 
one term will be 

2 n " 2 |/2r _1 [|A n -A| . . . |A 3 -A||A 2 -A|]" 1 < 2 n - 2 |/ 2 | n - 1 (2 7 )( n - 1 ) T [(n-l)!] T (5.5) 

if a G CD (7, r) and one obtains a divergent bound. Note the difference with the 
case |A| 7^ 1 : in this case the bound would be |A| - ( n-1 - ) 2 n-2 |/2| n-1 c n-1 for some 
positive constant c independent of /. Thus one must use a more subtle majorant 
series method. 
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The key point is that the estimate (5.5) is far too pessimistic : 

Exercise 5.2 Show that the series XmLi (\™-i) (a-i) ' w ^h A = e 27 " a , has 
positive radius of convergence whenever limsup n ^ 00 og9fc+1 < +00. [Hint : see 
[HL] for a proof.] 

Indeed when a small divisor is really small then, for a certain time, all other small 
divisors cannot be too small. This vague idea is made clear by the two following 
lemmas of A.M. Davie [Da] which extend and improve previous results of A.D. 
Brjuno. 

Let x G K, x 7^ 1/2, we denote \\x\\z — min pe ^ \x + p\. 

Lemma 5.3 Let a€l\Q, iPj/lj)j>o denote the sequence of its convergents, 
k G N, n G N, n 7^ 0, and assume that \\na\\z < l/(4(/fc)- Then n > qk and either 
qk divides n or n> q k +i/4:. 

Proof. From Theorem 4.5 it follows that if r is an integer and < r < q k then 
|| rQ; ||z > (2Qfc) _1 - Thus n > qk- Assume that qk does not divide n and that 
n < qk+i/4. Then n = mqk + r where < r < qk and m < qk+i / {Aqk) ■ Since 
||<?fca|U < Qk+i one § ets \\ m( lka\\z < mq^ < (Aqk)' 1 . But ||ra||z > {^qk)' 1 thus 
\\na\\ z > {Aqk)- 1 . D 

Using this information on the sequence (||no;||z)n>o Davie shows the following : 
Let A k = |n > | ||na|| < g^j, E k = max (q k , ?fc+i/4) and n k = qk/E k . Let A* k 
be the set of non negative integers j such that either j G Ak or for some j\ and j'2 
in Ak, with j 2 — ji < Ek, one has ji < j < J2 and ^ divides j — j±. For any non 
negative integer n define : 

/ (n) = max \ (1 + i] k ) 2, (m n r]k + n) 1 > (5.6) 

I Qk Qk J 

where m n = max-jj | < j < n, j G A£}. We then define a function /ijt : N — > K+ 
as follows 

M " ) = W) if^+ajTA; <5 ' 7) 

The function /i^ (n) has some properties collected in the following proposition 
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Proposition 5.4 The function hk (n) verifies 
(!) (1±J> _ 2 < h k (n) < ^^ - 1 for all n. 

(2) lfn>0 and n G A* k then h k (n) > h k (n - 1) + 1. 

(3) hk (n) > hk (n — 1) for all n > 0. 

(4) hk (n + qk) > hk (n) + 1 for all n. 

Now we set gk (n) = max ( hk (n) , — j and we state the following proposition 

Proposition 5.5 The function gk is non negative and verifies : 

(1) g k (0)=0; 

(2) g k (n) < L}± ^ for all n ; 

(3) g k (ni) + g k M < gk {n\ + n 2 ) for all m and n 2 ; 

(4) ifn G Ak and n > then gk (n) > g k (n — 1) + 1. 

The proof of these propositions can be found in [Da] . 

Let k(n) be defined by the condition qk( n ) < n < qk( n )+i- Note that k is 
non-decreasi ng . 



Lemma 5.6 (Davie's lemma) Let 



fc(n) 



K(n) =nlog2 + ^2 9k(n) log(2g fc+ i) . 



fc=0 



The function K (n) verifies : 
(a) There exists a universal constant Co > such that 



fe(n) 



log qk+i 



K{n) <n\J j V c 



fc=0 



qk 



(b) K(m) + K{ri2) < if (rii + 712) for all n\ and n 2 ; 

(c) -log|A n -l| < K(n)-K(n-1). 



(5.8) 



(5.9) 



Proof. We will apply Proposition 5.4. By (2) we have 



K{n) < n 



< n 



log2 + ^ii±^log(2, fc+1 



&=0 



fe(n) 00 / 1 . N 00 , 

vl^±i + log2 + log2 vfl + ^ + 4Vi^±i 
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since r] k < Aq k q k ^ v 

By Remark A2.4 the series Yl ogqk+1 and Yl Qk are uniformly bounded by 
some constant independent of a, thus (a) follows. 

(b) is an immediate consequence of Proposition 5.5, (3), and the fact that 
k(n) is not decreasing. 

Finally recall that 

— log |A n — 1| = — log 2 1 sin7ma| G (— log7r||nct||z, — log2||na||^) . 

For all n we have either ||na||z > 1/4 or there exists some non-negative integer 
k such that (8(/fc) _1 > ||na||z > (8(/fc-|_i) _1 , thus n G A k , which implies by 
Proposition 5.5, (4), that gk(n) > gk{n — 1) + 1, and — log|A n — 1| < 2q k +i. 
Combining these facts together one gets (c). □ 

Following [CM] we can now prove Theorem 5.1. 

Proof, of Theorem 5.1. Let s (z) = Y n >i s n zU be the unique solution analytic 

at z = of the equation s (z) = z + a (s (z)) : where a(z) = tj^jJt = Y n >2 nzU • 
The coefficients satisfy 

n 
Si = 1 , S n = ^ m E S ni ...S nm . (5.10) 

m=2 ni+...+n m =n , n^>l 

Clearly there exist two positive constants ci, c^ such that 

\ s n\ S c l c 2 • 

From the recurrence relation and Bieberbach-De Branges's bound \f n \ < n for all 
n > 2 we obtain 



1 n 



\h n \ < — — ■ > m > \h n , I . . . \h n I . 

— \ n A J J 

m=2 ni+...-\-n rrl =n , rii>l 

We now deduce by induction on n that \h n \ < s n e K ^ n_1 - ) for n > 1, where 
K : N — *■ R is defined in (5.8). If we assume this holds for all n' < n then 
the above inequality gives 

\ hn \ < ^L^ y m y Sni ... Sn e *(»i-D+-*(»™-i) . 

m=2 ni + ...+n m =n , rii>l 
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But K(m - 1) + . . . K(n m - 1) < K(n - 2) < K(n - 1) + log |A n - A| and we 
deduce that 

n 

as required. Theorem 5.1 then follows from the fact that n~ 1 K(n) < B(u) + cq 
for some universal constant cq > (Davie's lemma). □ 

Exercise 5.7 Consider the quadratic polynomial P\(z) = X(z — z 2 ) (we have 
conjugated (3.1) by an homothehty so as to eliminate a factor 1/2 in what follows). 
Its formal linearization H\(z) = Xl^Li H n (X)z n is given by the recurrence 

#i(A) = l, H n (X) = (1 - A"" 1 )" 1 J2 h ^ x ) h j( x )- 

Define the sequence of positive real numbers (h n (X)) n >i by the recurrence 

/H(A) = 1, h n (X) = \l-X n - 1 \- 1 J2 hi( x )hj(X). 

i-\-j=n 

Clearly \H n (X)\ < h n (X). Show that if A = e 2nta , a G K\Q is not a Brjuno number 
then limsup n ^ 00 n _1 log/i n (A) = +oo, i.e. the majorant series Y^ n >i hn(X)z n 
is divergent. Under what assumptions on a one can show that there exist two 
positive constants co,ci and s > such that h n (X) < coc™(n!) s for all n > 1? 
[Hint : First show that the set {n > | q n +i > (g n + l) 2 } is infinite and 
denote (^)i>o its elements. Then show that the sequence h n (X) is increasing and 

/y i+ i(A) > |1 — A^+i | — 1 (^ (? ^ + i(A)) L^^^J . One can also consult [Yo2, Appendice 

2, pp. 83-85] and [CM].] 

Exercise 5.8 (Linearization and Gevrey classes, see [CM] for solutions and more 
information. ) Between C[[z]] and C{z} one has many important algebras of 
"ultradifferentiable" power series (i.e. asymptotic expansions at z = of functions 
which are "between" C°° and C{z}). Consider two subalgebras A\ C A 2 of zC [[z]] 
closed with respect to the composition of formal series. For example Gevrey-s 
classes, s > (i.e. series F(z) = J2 n>0 f n z n such that there exist ci, C2 > such 
that \f n \< cic£(n!) s for all n > 0). Let / G A x being such that /' (0) = A G C*. 
We say that / is linearizable in A 2 if there exists hf G A 2 tangent to the identity 
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and such that / o hf = hf o R\. Show that if one requires A<i = A\, i.e. the 
linearization hf to be as regular as the given germ /, once again the Brjuno 
condition is sufficient. It is quite interesting to notice that given any algebra of 
formal power series which is closed under composition (as it should if one whishes 
to study conjugacy problems) a germ in the algebra is linearizable in the same 
algebra if the Brjuno condition is satisfied. If the linearization is allowed to be 
less regular than the given germ (i.e. A\ is a proper subset of A?) one finds new 
arithmetical conditions, weaker than the Brjuno condition. Let (M n ) n >i be a 
sequence of positive real numbers such that : 

0. inf n >! Mn /n > ; 

1. There exists C\ > such that M n+1 < C™ +1 M n for all n > 1 ; 

2. The sequence (M n ) n >i is logarithmically convex; 

3. M n M m < Mm+n-i for all m, n > 1. 

Let / = Xm>i f nZ,n e z< ^ [i z ]] ' / De l° n g s to the algebra zC [[z]], M \ if there exist 
two positive constants c\ , c^ such that 

|/n| < c x c^M n for all n > 1 . 



Show that the condition 3 above implies that zC [[z]], M s is closed for composition. 

J(iV„)> 



Show that if / E zC [[z]] (N \, fi = e 27ria and a verifies 



/^A logg fc+ i 1 M n \ 
Iim sup > log —— < +oo 

n^+oo \^ Q Qk n N n J 

where k(n) is defined by the condition qk( n ) < n < qk(n)+ii then the linearization 
hf G zC [[z]], M y (We of course assume that the sequence (iV n ) n >o is asymp- 
totically bounded by the sequence (M n ), i.e. M n > N n for all sufficiently large 
n). 



5.2 Yoccoz's Theorem 

The main result of Yoccoz can be very simply stated as 

y = {a e R \ Q | B(a) < +00} = Brjuno numbers 
but he proves much more than the above : 
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Theorem 5.9 

(a) If B(a) = +00 there exists a non-linearizable germ f G S e i-*i<x ; 

(b) If B(a) < +00 then r(a) > and 

\ log r (a) + B(a)\ <C, (5.11) 

where C is a universal constant (i.e. independent of a) ; 

(c) For all e > there exists C e > such that for all Brjuno numbers a one has 

-B(a) - C < logr(P e 2^ a ) < -(1 - e)B(a) + C e (5.12) 

where C is a universal constant (i.e. independent of a and e. 

The remarkable consequence of (5.11) and (5.12) is that the Brjuno function not 
only identifies the set y but also gives a rather precise estimate of the size of 
the Siegel disks. When ct is not a Brjuno number the problem of a complete 
classification of the conjugacy classes of germs in £^2™ is open and quite difficult 
(perhaps unreasonable) as the following result of Yoccoz shows : 

Theorem 5.10 Let a 6 1 \ Q, B(a) = +00. There exists a set with the power 
of the continuum of conjugacy classes of germs ofG^^ia, each of which does not 
contain an entire function. 

The proof of the Theorem of Yoccoz (5.9 above) uses a method, invented by 
Yoccoz himself, known as geometric renormalization. Roughly speaking it is a 
quantitative version of the topological construction of Douady-Ghys described in 
Chapter 4 and which shows that the set y is SL (2, Z)-invariant. Whereas the 
construction of non-linearizable germs / G G e 2». a when B(a) = +00 and Yoccoz's 
upper bound 

logr(a) < C -B(a) 

go far beyond the scope of these lectures, it is not too difficult to give an idea of 
how Yoccoz proves Theorem 5.1, i.e. the lower bound 

logr(a) > -C-B(a) . 

Let / G S e 2*i a and let E : M —> D* be the exponential map E(z) = e 2iriz . Then / 
lifts to a map F : HI — ► C such that 

EoF = foE , (5.13) 

F : U -»■ C is univalent , (5.14) 

F(z) = z + a + <f(z) where <p isZ periodic and lim <p(z) = (5.15) 
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We will denote S(a) the space of univalent functions F verifying (5.13), (5.14) and 
(5.15). 

Exercise 5.11 Show that S(a) is compact and that it is the universal cover of 
Exercise 5.12 Show that if / G Si then one has the following distorsion estimate : 






2\z\ 
< — ^forall^GD. (5.16) 



Exercise 5.13 Use the result of the previous exercise to show that if ip is as in 

(5.15) and z eU then 

1 — exp(— 2tx Xsmz) 
\<p(z)\ < — log(l-exp(-27r9fmz)) . (5.18) 

TV 

Let r > 0, M. r = M. + ir. It is clear that if F G S(a) and r is sufficiently large 
then F is very close to the translation z \-^ z + a for z G M. r . Indeed using the 
compactness of S(a) and Exercise 5.13 one can prove the following : 

Exercise 5.14 Let a^O. Show that there exists a universal constant cq > (i.e. 
independent of a) such that for all F G S(a) and for all z G H t ( a ) where 

I 

t(a) = — log a" 1 + c , (5.19) 

one has 

ct 
|F(z)-z-a| < - . (5.20) 

[Hint : Let (p(z) = F(z) - z - a = E^i^e 2 ™ 2 . If Smz > t(a) then 
\F(z) -a-z\< J2n=i \¥>n\a n e- 2linco thus . . ..] 

Given F, the lowest admissible value t(F, a) of t{a) such that (5.20) holds for all 
z G HIt(F,a) represents the height in the upper half plane EI at which the strong 
nonlinearities of F manifest themselves. When Qmz > t(F, a), F is very close to 
the translation z ^ z + a. An example of strong nonlinearity is of course a fixed 
point : if F(z) = z + a + ^- i e 2ntz , a > 0, then z = -\ + ^ log(27ra) _1 is fixed 
and t(F, a) > ± log^a)" 1 . 
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The estimates (5.19) and (5.20) are the fundamental ingredient of Yoccoz's 
proof of the lower bound (5.2) together with Proposition 2.15 and the following 
elementary properties of the conformal capacity. 

Exercise 5.15 Let U C C be a simply connected open set, U ^ C, and let zq G U. 
Let d be the distance of z$ from the complement of U in C and let C(U, z$) denote 
the conformal capacity of U w.r.t. zq. Then 

d < C(U, z ) < Ad . (5.21) 

As in the proof of Douady-Ghys theorem we can now construct the first return map 
in the strip B delimited by / = [it(a), +ioo[, F(l) and the segment [it(a), F(it(a))]. 
Given z in B we can iterate F until ?R,eF n (z) > 1. If 3mz > t(a) + c for some 
c > then z' = F n (z) — 1 E B and z i— »■ z' is the first return map in the strip 
B. Glueing / and F(l) by F one obtains a Riemann surface S corresponding to 
hit 5 and biholomorphic to D*. This induces a map g G S^i/a which lifts to 
G G S(a~ 1 ). One can then show the following (see [Yo2], pp. 32-33) 

Proposition 5.16 Let a G (0, 1), F G S(a) and t(a) > such that if^sm z > t(ot) 
then \F(z)— z— a\ < a/4. There exists G G Si^a -1 ) such that if z G H, Qmz > t(a) 
and F { (z) G H for all i = 0, 1, . . . , n - 1 but F n (z) £ B. then there exists z' G C 
such that 

1. Qmz' > a~ 1 (^smz — t(a) — ci), where c\ > is a universal constant ; 

2. There exists an integer m such that < m < n and G m (z') (jL H. 

From this Proposition one can conclude the proof as follows. Let us recall that 
Kf = n n >o/ _n (D) is the maximal compact /-invariant set containing 0. Let 
F G S(a) be the lift of / G S e 2^ ia and let Kp C C be defined as the cover of Kf : 
Kp = E -1 ^^. It is immediate to check that 

dp = sup{3mz | zeC\K F } = logdist (0,C \ Kf) . 

Thus by (5.21) one gets 

exp(-27rrf F ) < C(K f ,0) < 4ex.p(-2ird F ) . 
Theorem 5.1 is therefore equivalent to the lower bound 

sup d F <— B(a)+C (5.22) 

FES(a) 27T 

36 



for some universal constant C > 0. 

Assume that (5.22) is not true and that there exist a 6 1 \ Q fl (0, 1/2) with 
B(a) < +oo , F G S(a), z£i and n > such that 

SmF n (z) <0, 

3mz> — B(a) + C . 

Let us choose a, F and z so that n is as small as possible. By Proposition 
5.16, if C > Co, one gets 

^smz' > a~ 1 \^smz — t(a) — c±] 



>a~ 1 



— (B(a) -log a 1 ) + C-c -c 1 

Ztv 



By the functional equation of B one gets 

Smz' > — S(a _1 ) + a _1 [C-co-ci] > — S(a _1 ) + C 

provided that C > 2(co + ci). But Proposition 5.16 shows that this contradicts 
the minimality of n and we must therefore conclude that (5.22) holds. □ 

A nice description of the proof of the upper bound log r(a) < C — B(a) can be 
found in the Bourbaki seminar of Ricardo Perez-Marco [PM1] . 



5.3 Some Open Problems 

The first open problem we want to address is whether or not the infimum in (5.1) 
is attained by the quadratic polynomial P\(z) — Xz (l — |) : 

Question 5.17 Does r(a) = inf/gs 2nia r(f) = r2(e 27 ™ a ), i.e. the radius of 
convergence of the quadratic polynomial ? 

If this were true then the very precise bound (5.11) would hold also for P\ : 
recalling that r2(A) = |w(A)|, where u : D — > C is the function defined in Section 
3.1, one can ask 

Question 5.18 Does the function a h-> log \u(e 2iria )\ + B(a) E L°°(§ 1 ) ? 

Indeed there is a good numerical evidence that much more could be true : 
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Conjecture 5.19 The function a i— >• log \u(e 2lTlOL )\ +B(a) extends to a Holder 1/2 
function. 

We refer to [Mai] and to [MMY2] and references therein for a discussion of 
Conjecture 5.19. The next Exercises give an idea of how to compute approximately 
but effectively the function a ^ log |-u(e 27 " a )| on a computer. More informations 
can be found in [He4] (where one can also find many problems, most of which are 
still open) and [Mai]. 

Exercise 5.20 Let / G G\ be linearizable, A = e 27Zia . Let Uf be the Siegel 
disk of /, hf be the linearization of /, z G Uf, z = hf(w), where w G D r (/) 5 
\w\ = r < r(f). Show that 



m— 1 



_. m— 1 

lim -^log|i^)| = logr (5.23) 



j=o 



[Solution : Since hf conjugates / to R\ one has fi(z) = fi(hf(w)) = hf(X'w) for 
all j > and w G D r (/); thus 



m—l _, m — l 



-. I I v — J. j ( f </ _1_ 

- ^log|^(^)| = — ^log|^(A^)|. 

n ^— » rn ^— ' 



m ' — ' m 

J=0 J=0 



/i/ has neither poles nor zeros but w = thus by the mean property of harmonic 
functions one has L log \hf{re 2 ' K ' LX )\dx = logr for all r < r(f). Finally note that 
w i— > Aiu is uniquely ergodic on |tu| = r, and in this case Birkhoff 's ergodic theorem 
holds for all initial points, thus 



m— 1 _. m—l 



lim — V \og\f 3 (z)\ = lim — V log \h f (X J w)\ 
i=o i=o 

= / log|/i/(re 27rjx )|(ix = logr . 

Exercise 5.21 Deduce from the previous exercise that for almost every z G dUf 
with respect to the harmonic measure one has 

_. m — l 

lim -y"log|/ J '(z)| = logr(/). (5.24) 
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Let us now consider the quadratic polynomial P\ once more. According to (5.24) 
to compute |it(A)| one needs to know that some point belongs to the boundary of 
the Siegel disk of P\ (and hope . . .). The critical point cannot be contained in Up x 
because f\u P is injective, and from the classical theory of Fatou and Julia one 
knows that dUp x is contained in the closure of the forward orbit {Pj-(l) | k > 0} 
of the critical point z — 1. Finally Herman proved if a verifies an arithmetical 
condition 7i, weaker than the Diophantine condition but stronger than the Brjuno 
condition (see, for example, [Yol] for its precise formulation) the critical point 
belongs to dUp x . 

Exercise 5.22 If a G CD (0) Herman has also proved that dUp x is a quasicircle, 
that is the image of S 1 under a quasiconformal homeomorphism. In this case 
hp x admits a quasiconformal extension to \w\ = r2(X) and therefore is Holder 
continuous [Po] : 

\hp x (w!) - h Px (w 2 )\ < 4\ Wl - w 2 \ 1 ~ x (5.25) 

for all W\ ,W2 G 9D r2( - A ), where x G [0, 1[ depends on A is the so-called Grunsky 
norm [Po] associated with the univalent function g(z) = r2(X)/hp x (r2(\)/z) on 
\z\ > 1. Using this information show that 

i <?fe-i o 2ir 

|- J2 log\Pl(z)\ - logr 2 (A)| < - 7 tt(-) 1 - x , (5-26) 

where Pk/Qk is a convergent of the continued fraction expansion of a. Note that 
(5.26) implies convergence to logr2(A) for all z G dUp x , thus also for the critical 
point z = 1. 
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6. Small divisors and loss of differentiability 

In this Chapter we will (very !) briefly illustrate other two completely different ap- 
proaches to the problem of linearization of germs of holomorphic diffeomorphisms 
with an indifferent fixed point. 

In the previous chapter we saw how the optimal sufficient condition can be 
obtained by the classical majorant series method as Siegel and Brjuno did and 
that Yoccoz was able to show that it is also necessary with his ingenious creation 
of "geometric renormalization" . Here we will give an idea of two proofs of the 
Siegel theorem, one due to Herman [Hel, He2] and the other due essentially to 
Kolmogorov [K] (see also Arnol'd [Ar3] and Zehnder [Ze2]). 

Herman's method is far from giving the optimal number-theoretical condition, 
the idea is simply so original and beautiful that it deserves being known. It also 
illustrates how in one-dimensional small divisor problems the problem known as 
"loss of differentiability" does not prevent from the application of simple tools 
like the contraction principle. Herman's method can also be extended to (and it 
is actually described by him for) the problem of local conjugacy to rotation of 
smooth orientation-preserving diffeomorphisms of the circle. 

The idea of Kolmogorov is do adapt Newton's method for finding the roots 
of algebraic equations so as to apply it for finding the solution of the conjugacy 
equation. This method has been shown by Riissmann [Rii] to be adaptable so as 
to prove the sufficiency of a condition (slightly stronger than) Brjuno's. The main 
reason for sketching Kolmogorov's argument in this rather limited setting is that in 
the second part of this monograph we will illustrate Nash-Moser's implicit function 
theorem which is essentially the abstract and flexible formulation of Kolmogorov's 
idea. 



6.1 Hardy-Sobolev spaces and loss of differentiability 

Let k G N, r > 0. Following [He2], we introduce the Hardy-Sobolev spaces 

oo oo 

O k r > 2 = {f(z) = J2fnZ n I \\f\\ okr ,* = (\fo\ 2 + J2 n2k \fn\ 2 r 2n ) 1/2 <+oo}. (6.1) 

n=0 n=l 

Exercise 6.1 

(a) Show that O^ 2 is a Hilbert space. 
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(b) Show if / 6 C>*' 2 , /(0) = 0, one has 



\f(z)\ < VP) 



sup \j{z)\ s VU^;il/llo fc ' 2 > 

|z|<r 

where ( denotes Riemann's zeta function. 

(c) Show that if k > 1 then C^> 2 is a Banach algebra. 

(d) Show that if / G O k ' 2 and is holomorphic in a neighborhood of /(D r ) then 
4> o / G C^?' 2 and on a sufficiently small neighborhood V of / in O k ' 2 the map 
^67t->^oi/)e O k ' 2 is holomorphic. 

The following very elementary proposition well illustrates the phenomenon of "loss 
of differentiability" due to the small divisors which already arises at the level of 
the linearized conjugacy equation (6.2). 

Proposition 6.2 Let < r < r , r G N, k > 1 + r , / G C^' 2 , /(0) = 0. If 
a G CD(t) then the unique solution g := D^ 1 f verifying g(0) = of 

goR x -g = f, (6.2) 

belongs to O l ^~ 1 ~ To ' 2 . Moreover there exists a universal constant C > such that 



d k - 1 - To g. l C..d k f, 



n ' 2 — — II — i — II n ' 2 



dz Ur 7 dz 
where 7 = inf n >i n 1+r |A n — 1|. 

Proof. It is a straightforward computation starting from the identity g(z) = 

E^l(A n -l)-7n- □ 

This Proposition shows that solving the linear equation (6.2) the small divisors 
cause the loss of 1 + tq derivatives. This loss of differentiability phenomenon 
is typical of small divisors problems and it will be crucial in the discussions 
in the second part of this monograph. The most annoying consequence of this 
phenomenon is the impossibility of using fixed points methods to solve conjugacy 
equations, simply because the operator D\ is unbounded if regarded on a fixed 
Hardy-Sobolev space. However under some restriction on t$ one can actually use 
the contraction principle to solve the conjugacy problem, thanks to an ingenious 
idea of Herman we will shortly describe in the next section. 
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6.2 Herman's Schwarzian derivative trick 

Let Obea region in the complex plane and / : O — > C be holomorphic. 

Definition 6.3 The Schwarzian derivative S(f) of f is 

s(f) := (log/')" - ^((iogf )T = Y~l(y) 2 



■2 V/' 



1 



T 



(6.3) 



Exercise 6.4 

(a) Prove that the following "chain rule" holds : S(f og) = (S(f) °g)(g') 2 + S(g). 

(b) Show that S(f) = if and only if / is a Mobius map. 

The idea of Herman is to apply the Schwarzian derivative to the conjugacy equation 
foh — ho R\ : one obtains 

X 2 (S(h) o R x ) - S(h) = (S(f) o h){h'f . (6.4) 

At the r.h.s. appears b! , thus one has already lost one derivative and this does not 
seem to lead to anything good. Assuming the r.h.s. as given one could solve for 
S(h) but this would cost 1 + tq derivatives according to Proposition 6.2. However 
if To = 1 (which is true for almost all a as we saw in Section 4.2) the total loss 
of derivatives is three. The idea now is that applying S~ x one should recuperate 
three derivatives and this would imply that the map 

O k r ' 2 3h~ S-'D-'KSif) o h){ti) 2 ] (6.5) 

takes its values in C^> 2 too. Note that we have slightly modified the definition of 
D\ with respect to the previous Section : here D\f = X 2 f o R x — f. This does 
not change the conclusions of Section 6.1. 

On a disk D r of sufficiently small radius / is close to R\ thus S(f) must be 
small and one can hope to conclude using a fixed point theorem (the contraction 
principle, say). This strategy indeed works : see [Hel, He2] for the details. 

The inversion of S is achieved as follows. First of all note that it is not 
restrictive to assume /"(0) = 0, so that h"(0) — (one can preliminary conjugate 
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/ by the polynomial z — wx~W) z2 ) '■ this implies [(S(f) o h)(h') 2 ] z=0 = 0. Let 
ip = D x [(S(f) o h)(h') 2 ]. If ip is small enough (this is always the case if one 
considers a sufficiently small disk) then one can write if) uniquely in the form 

1> = V>i - \l>{ 

with -01 (0) = 0. Now one can easily solve the problem S(h\) = ip = ip[ — |Vi J us t 
looking for h\ such that (log/i^)' = ip\. This is achieved in three steps : 

ip' 2 = -01 , 

ipa = e c+ ^ 2 where c is chosen s.t. ^(O) = 1 , 



hi = / M()d( ■ 
Jo 

Then it is immediate to check that S(h\) = ip. 

Exercise 6.5 Let r > 0, ip and h\ as above. Show that if ip E O^ 2 then h\ E Of' 2 . 

6.3 Kolmogorov's modified Newton method 

Here we follow quite closely [St], Volume II, Chapter III, Section 7. We suggest 
however the reader to look also at [Ze2] for a complete proof. 

Let / G S\, A = e 27ria and assume that a is a diophantine number. We want 
to construct h tangent to the identity such that R\ — h~ x o f oh = 0. Let h = h — id 
and let us define the composition law as 

(id + /n)o(id + /i 2 ) =id + h 1 Qh 2 ■ (6.6) 

Clearly one expects that 

hiQ h 2 = hi + h.2 + quadratic terms . 

Let f — f — R\ and let us define a second composition law ® as 

R\ + h ® f = (id + h)- 1 o (R x + f) o (id + h) . (6.7) 



Of course one needs h be small so as to assure the existence of the inverse in 
(6.7) but this is not difficult to obtain considering a small enough disk since 

?2 ; 
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h = h 2 z 2 + 



Exercise 6.6 Recall Lagrange's Theorem on the inversion of analytic functions 
(see [Di], p. 250) : if h : D r — ► C is holomorphic and tangent to the identity then 
choosing r small enough there exists a unique solution z = k(w) of the equation 
w = h(z). Moreover k is holomorphic in a neighborhood of and is explicitly 
given by 

«(«,) = y y -^~- — T (h( w )) n . 

' ^ n\ dw 71 ' 1 v " 

n=l 

Get some precise estimate on the size of the domain and of the norm of k if h 
belongs to some Hardy-Sobolev space. 

We then define 

TZ(h, /) = (id + h) o R x - (R x + /) o (id + h) . (6.8) 

Clearly ^(0, 0) = 0, 7^(0, /) = -/ and 

n(hi h 2 , f) = (id + In) o (id + ~h 2 ) °R\- (Rx + f) o (id + In) o (id + ~h 2 ) (6.9) 
K(h 2 , h ® /) = (id + h 2 ) o R x - (id + h)- 1 o (R x + f) o (id + In) o (id + fftXO) 

Comparing (6.9) with (6.10) we have (for z small enough) 

(inf 11+^1)17^2, /ii<g>/)(z)| < \n(hiQ~h 2 J)(z)\ < (sup 11+^1)1^2,^®/)^)! , 

thus one should get 

C- 1 \\K(h2,h 1 ®f)\\ < \\n(InQh 2 J)\\ <C||7e(^ 2 ,^i(8)/)|| . (6.11) 

for a suitably chosen norm || || and some C > (see Exercise 6.6). 

Let us now try to solve the equation 1Z(h, f) = by taking a sequence of 
approximations defined as follows : 

(0) Let h = g = 0, fo = f- thus TZ(h , f ) = -f ; 

(1) Let fi = go<S> fo = fo- Choose g± to be the solution of the linearized equation 

<W(o, o)$i + d 2 n(o, o) A = o , (6.12)! 

where dj denotes the partial derivative w.r.t. the j-th argument. Finally we 
set hi — ho © gi- 
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(i+1) We choose /j+i = gi® fi and gi+i to be the solution of 

d 1 TZ(0, 0)g i+1 + d 2 ft(0, 0)/ i+1 = , (6.12) i+1 

and we set hi+i = hi jji+i. 

It is immediate to check that the linearized equations (6.12)j have the form 

gioR x - R x ogi = f t , (6.13) 

which we studied in Section 6.2. Note that we do not linearize at the point (0, /) 
since we would get a difference equation without constant coefficients : 

cWO, £_!)& + d 2 Tl(0, fi-Jfi = g i oR x -R x og i - ?._& = . 

If one could solve (6.13) at each step with a bound 

ll&II^CH/ill, (6-14) 

by (6.11) and (6.14) one would have 

||^iJi)||<sup||d 2 7i||(||5 i || 2 + ||/ i || 2 <C' 3 ||^ i _i,/ i _i)|| 2 . (6.15) 

This would imply the convergence of the iterative scheme to a solution oflZ(h, f) = 
provided that one chooses ||/|| small enough (i.e. one considers the restriction 
of / to a small enough disk D r ) : indeed iterating (6.15) one gets 

WnCgiJM < (C 3/2 \\n(~g J )\\f 

thus again by (6.11) one has 

\\n(hi,f)\\ = \\K(g 910... QgiJ)\\ 
<C\\'R(g 1 O...Qg l ,go®f)\\ 
<C7* ||^,, /OH 
<(C 2 ||^o,/o)||) 21 

Exercise 6.7 Assuming the estimates above show that the sequence h n converges 
thus by continuity of 1Z one gets the desired result. 
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Exercise 6.8 Use the above scheme to give an alternative proof of Koenigs- 
Poincare theorem. 

The above discussion shows how to prove the existence of the linearization 
disregarding the problem of loss of differentiability due to small divisors. This 
makes impossible to get an estimate like (6.14) unless one regularizes the r.h.s.. 
The simplest method of regolarization, which is adapted to the analytic case, is to 
consider restrictions of the domains : 

Exercise 6.9 Show that if / e C°' 2 , /(0) = 0, k e N, for all 5 > one has 

"%; 2 -* - G ) e_fc|l/l1 ^ 2 - (6 - 16) 

Combining the above given discussion with a suitable choice of restrictions (i.e. 
a sequence (5 n ) n >o such that Y^=o^ n < +°°) ° ne can indeed prove Siegel's 
Theorem following the iteration method. 
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Part II. Implicit Function Theorems and KAM Theory 



7. Hamiltonian Systems and Integrable Systems 

In this Chapter we will very briefly recall some well-known facts on symplectic 
manifolds and Hamiltonian systems. Very good references are [AKN] and [AM]. 



7.1 Symplectic Manifolds and Hamiltonian Systems 

Definition 7.1 A C°° symplectic manifold is a 2l-dimensional C°° manifold M 
equipped with a non-degenerate C°° two-form, (the symplectic form) to. A C°° map 
f : U — ► M' where U C M is open and M' is also symplectic (with symplectic 
form uj') is symplectic (or canonical,) if f*u' = to. 

The simplest (but important) examples of symplectic manifolds are : 

• M = M. 21 3 (pi, . . . ,pi, qi, . . . , qi), uj = J2 i=1 dpi A dqi (standard symplectic 
structure). If U and V are two open sets in M? 1 and / : U — » V then / is 
symplectic if and only if its Jacobian matrix J t G Sp (I, R), the Lie group of 

• M = T*N where N is a C°° Riemannian manifold. This is the typical situation 
in classical mechanics. If (qi, . . . , qi) are local coordinates in A^ and (pi, . . . , pi) 
are the corresponding local coordinates in the cotangent space at a point, then 

U = Y.i=l d Pi A %• 

• M = T 2 \uJ = Y! ^=1 de ^ ^de ^+l . 

Theorem 7.2 (Darboux) Each symplectic manifold M has an atlas (U a , f a )ceeA 
such that on <fi a (U a ) C M. 21 one has uj = </?* J2 i=1 dpiAdqi (the standard symplectic 
structure on M 21 ). The transition maps cp a o <pZ are symplectic diffeomorphisms, 
i.e. their Jacobians J a p(x) G Sp (/, R) for all x G <pp(U a fl Up). 

The atlas given by Darboux's Theorem and the corresponding local coordinates 
are called symplectic. 
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Definition 7.3 A Hamiltonian function on a symplectic manifold (M, uj) is a 
function H E C°°(M, R). The Hamiltonian vector field associated to H is the 
unique X H E C°°(M, TM) such that i Xh oj = dH. 

Note that in symplectic local coordinates a Hamiltonian vector field takes the form 



i=l 



dqi dpi dpi dqi 



and the associated ordinary differential equations are the classical Hamilton's 

equations of the motion of a conservative mechanical system with / degrees of 

freedom : 

dH dH . , 

dqi dpi 

Clearly the Hamiltonian function is a first integral of (7.2). The coordinates qi 
are also called "generalized coordinates" and the Pi their "conjugate momenta" . 
In many problems arising from celestial mechanics the flow is not complete due to 
the unavoidable occur ance of collisions, but we will always assume completeness 
of the Hamiltonian flow. 

Definition 7.4 The Poisson bracket of two functions f,g E C°°(M, R) defined 
on an open subset of (M,u>) is {F, G} := XqF = uj(Xf,Xg) = —XpG , thus 
X{f,g} — ~ [Xf,Xg] ■ Two functions F,G are in involution if {F,G} = 0, i. e. 
when their hamiltonian flows commute. 

Exercise 7.5 Show that the Hamiltonian flow $ : K. x M — ► M is symplectic : 
for alH E R one has $(£, -)*oj = uj. [Hint : use Cartan's formula ^| i=0 ^ , (^ O* 1 ^ = 
d{ix H oj) + ix H du), where Xh = Jj|t=o^'(^') is the Hamiltonian vector field 
associated to $.] 

The importance of Exercise 7.5 is that to make symplectic coordinate changes 
of a Hamiltonian vector field it is sufficient to change the varables in the corre- 
sponding Hamiltonian function. This is a simpler operation, both conceptually 
and computationally. 

As we will see in the next Section, among the possible orbits of Hamiltonian 
systems, quasiperiodic orbits are of special interest. 
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Definition 7.6 A continuous function F : R — ► R is quasiperiodic if there exist 
n>2, f : T n -»■ R continuous and i/Gl"\ {0} such that F(t) = /(z^t, . . . v n t). 

Let M = [k G Z n | k ■ v — 0}. Note that .M is a Z-module. If dim .M = n then 
z/ = 0, if dim.M = n — 1 then there exists a£K and fceZ" such that v = ak. If 
dim.M = then v is called non-resonant. 

Exercise 7.7 Show that the closure of any orbit of the linear flow 9 = v on T n is 
diffeomorphic to the torus Y n ~ dimM . 

Exercise 7.8 Show that if dim.M G {1, . . .n — 1} there exists A e SL (n, Z) such 
that posing </? = A9 the linear flow 9 = v on T n becomes <^j = for i = 1, . . . , m 
and (fi = »[ for i = m + 1, . . . , n with {y' m+1 , . . . , i^J G IR n_m non-resonant. 

Exercise 7.9 Show that if v is non-resonant then the Haar measure on T n is 
uniquely ergodic (see [Mn] for its definition) for the linear flow 9 = v on T n . 

7.2 Integrable Systems 

An especially interesting example of symplectic manifold is M — M 1 x T ; which can 
be identified with the cotangent bundle of the /-dimensional torus T' = M'/(27rZ) ; . 
This manifold has a natural symplectic structure defined by the closed 2-form 
u = J2i=i dJi A d"&i where ( Ji, ...«/;, i?i, . . . i?f) are coordinates on M. 1 xT 1 . 

Definition 7.10 Let U denote an open connected subset ofM 1 . Whenever an 
Hamiltonian system can be reduced by a symplectic change of coordinates to a 
function H : U x T^ — *■ R which does not depend on the angular variables $ one 
says that the system is completely canonically integrable and the variables J are 
called action variables. 

Note that in this case Hamilton's equations (7.2) take the particularly simple form 
• dH • dH 

and the flow leaves invariant the /-dimensional torus J = constant. The motion 
is therefore bounded and quasiperiodic (or periodic). 

Being completely canonically integrable is a stronger requirement than inte- 
grability by quadratures or complete integrability (see [AKN] for their discussion). 
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In the latter case one requires the existence of / independent first integrals in invo- 
lution but their joint level-set may well be non compact (this is already the case 
in the two body problem for non negative energy values) and the flow does not 
need to be quasiperiodic (scattering states). 

The main risult in the theory of completely canonically integrable systems is 
the celebrated 

Theorem 7.11 (Arnol'd-Liouville)Lei H € C°°(M,R) and assume that 
F\, . . . , Fi G C°°(M, R) are I first integrals in involution for the Hamiltonian flow 
associated to H. Let a G R be such that M a = {m G M | Fi(m) = a^Vz = 
1, . . . , /} is not empty and assume that the I functions F±, . . . , Fi are independent 1 
in a neighborhood of M a . Then if M a is compact and connected it is diffeomor- 
phic to the l-torus. Moreover there exists an invariant open subset V of M which 
contains M a and is symplectically diffeomorphic to U x T , where U is an open 
subset ofR 1 . 

Arnol'd-Liouville's Theorem thus assures that the existence of sufficiently many 
first integrals together with the compactness and connectedness of their level set 
guarantees complete canonical integrability. 

7.3 Examples of completely canonically integrable systems 

In this section we will briefly describe some examples of completely canonical 
integrable systems. 

Example 7.12 : Harmonic oscillators. Let M = M 21 with the standard 
symplectic structure, S G GL (2/, R) be symmetric and positive definite. Consider 
the Hamiltonian system H(x) = ^x T Sx. This is completely integrable. Indeed 
if J is a symplectic matrix which diagonalizes S, in the variables y = J~ l x the 
Hamiltonian will be 

^ Xyt + Xi+wt+i 



H(y) 



i=i 



where A^ , i = 1, . . . , 2/ are the eigenvalues of S. Then the functions Fi = 
-^ — 2 I+t l+l , i = 1, . . . , /, are independent first integrals in involution and their 
common level set is compact and connected (since A^ > for all i). The symplectic 



1 As usual Fi, . . . , F[ are independent if dF\ A ... A dFi ^ 0. 
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transformation to action-angle variables is 



Vi = y2J iA /A i+z /A i cos Xi , Vi+i = y2J i A/A^%+z" sm Xi » *=1,...,Z. 

Example 7.13 : The two body problem. The Hamiltonian H : T*(IR 3 \{0}) i-» 

R of the two-body problem in the center of mass frame is (we have assumed G = 1, 
where G is the universal gravitational constant) 

2 ^ \\q\\ 

where fi = mom/ (mo + m) is the reduced mass of the system. 

It is well-known that for negative energy the solutions are ellipses with one 
focus at the origin (i. e. the center of mass). These are called keplerian orbits. The 
shape and the position of the ellipse in space are determined from the knowledge 
of the major semiaxis a, the eccentricity e, the angle of inclination i of its plane 
w.r.t. the horizontal plane qs = 0, the argument of perihelion uj and the longitude 
of the ascending node O. The position of the planet along the ellipse is determined 
by the mean anomaly /, which is proportional to the area swept by the position 
vector q of the planet starting from the perihelion. 

The systems admits 5 independent first integrals : the total energy H, the 
three components of the angular momentum q A p and one of the components of 
the Laplace-Runge-Lenz vector A = pAq Ap— ^nfy 2 - Among these integrals one 
can choose three integrals in involution and construct the completely canonical 
transformation to action-angle variables. The other two integrals are responsible 
for the proper complete degeneration of the Kepler problem : one can choose 
action-angle variables so that the Hamiltonian depends only on one of the actions. 
Indeed the Delaunay action-angle variables (L, G, O, /, g, 6) are related to the 
orbital elements as follows : 

L = n\J(mo + m)a , G = Ly 1 — e 2 , = Gcosi , /, g — u , 6 — 0, . 

Note that G is the modulus of angular momentum q A p, thus is its projection 
along the (/3-axis. One has the obvious limitation |0| < G. The new Hamiltonian 
reads H = -^ff^. 

The relation among Delaunay variables and the original momentum-position 
(p, q) variables is much more subtle and will not be discussed here. 
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The two-body problem is the modelization of the motion of a planet around 
the Sun. But the Delaunay variables are not suitable for the description of the 
orbits of the planets of the solar system since they are singular for circular orbits 
(e = 0, thus L = G anf the argument of the perihelion g is not defined) and for 
horizontal orbits (i = or i = re, thus G = © and the longitude of the ascending 
node 9 is not defined) . But all the planets of the solar system have almost circular 
orbits (with the exception of Mercury and Mars) and small inclinations. 

Poincare solved the problem first introducing a new set of action-angle 
variables (A, H , Z, A, h, () : A = L, H = L - G, Z = G - ©, A = / + g + 9, 
h = —g — 9, ( = —9 (A is called the mean longitude, —h is the longitude of 
the perihelion) then considering the couples (H, h) and (Z, £) as polar symplectic 
coordinates : 

£i = V2H cos h , r\\ = \/2Hsinh , £2 = v2^cosC , 772 = v2iTsin£ . 

The variables (A, £, A, rj) are called Poincare variables. They are well defined also 
in the case of circular (H = 0) or horizontal (Z = 0) orbits. 

Example 7.14 : Motion of a "heavy" particle on a surface of revolution. 

Let 5 C I 3 be a surface of revolution with the Riemannian metric induced by its 
embedding into M 3 and assume that X3 is its symmetry axis. Let / G C°°(R, R). 
If the surface never meets the xs axis then it is diffeomorphic to the cylinder 
S ~ IRxT 1 and its cotangent bundle will be 1R 3 xT 1 . If (pi,p2, qi, 92) are symplectic 
coordinates the Hamiltonian of a (heavy) point mass constrained to move on S is 
H(p,q) = ^lbl| 2 + f(qi)- Here / is the "weight" and P2 (which corresponds to 
the projection of the angular momentum of the particle along the a^-axis) is an 
independent integral of the motion. Complete integrability is assured if the curve 
{{Pii <7i) ^ R 2 I H(jpi, a, qi, 52) = E} is closed for some value of a and E. 
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8. Quasi-integrable Hamiltonian Systems 

The importance of completely canonically integrable Hamiltonian systems is due 
both to the fact that their flows can be studied in great detail and that many 
problems in mathematical physics can be considered as perturbations of integrable 
systems. The most famous example is given by the motion of the planets in the 
Solar System (see [Ma2] and references therein for an introduction). If the (weak) 
mutual attraction between the planets is neglected the system decouples into 
several independent Kepler problems and it is completely integrable. Exactly this 
problem gave origin in the 18th century to "perturbation theory" whose modern 
formulation is mainly due to the monumental work of Henri Poincare [P]. The 
goal of pertubation theory is to understand the dynamics of a "perturbed" system 
which is close to a well-understood one (usually an integrable system) . 

8.1 Quasi-integrable Systems 

Following Poincare [P], the fundamental problem of dynamics is the study of quasi- 
integrable Hamiltonian systems : let Eq > 0, 

Definition 8.1 A quasi-integrable Hamiltonian system is a function 7i G 

C°°((-£ ,£o) x M,R) such that the Hamiltonian function H = H(0, •) : M -»■ R 
is completely canonically integrable. 

Using the canonical transformation to action-angle variables associated to 7i(0, •), 
Hamiltonians TL : (— £o? £ o) x (7 x T 1 h R (smooth or analytic) of the form 

H(s,J,x) = ho(J) + sf(J,x), (8.1) 

where / G C°°(U x T , R), are typical examples of quasi-integrable Hamiltonian 
systems. 

The most ambitious program would be to prove that quasi-integrable Hamil- 
tonian systems are indeed integrable : i.e. to show that there exists a one- 
parameter family V s of open connected invariant subsets of M which are sym- 
plectically diffeomorphic to U e x T' where U e C M. 1 is open, connected and such 
that if (J, x) are the coordinates in U e x T' one has Ti(s, •, •) \v e — h £ (J) for some 
smooth one-parameter family of smooth function h £ : U e x 1^ — ► R. 

In general this is asking too much : a result of Poincare shows that in general 
quasi-integrable Hamiltonian systems are not completely integrable (in addition 
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to [P], Tome I, Chapitre V, see [BFGG] for a nice discussion of the consequences 
of this problem and a related result of Fermi). 

Theorem 8.2 (Poincare) Consider a quasi-integrable Hamiltonian of the form 
(8.1), / > 2. Assume that the two following genericity assumptions are satisfied : 
(1) non-degeneracy : det ( qj.qj ) ^ on U ; (2) generic perturbations : for all 
J G U and for all k G Z l \ {0} either the k-th Fourier coefficient fk(J) of f does 
not vanish or there exists k' G ll \{0} parallel to k such that fk'(J) 7^ 0. Then the 
system is not a smooth one-parameter family of completely canonically integrable 
Hamiltonians. 

One can also recall the following theorem of Markus and Meyer [MM] 

Theorem 8.3 Generically hamiltonian systems are neither completely canoni- 
cally integrable nor ergodic 

Exercise 8.4 Prove Poincare's Theorem following these lines. Using the notations 
introduced above, if the system were completely canonically integrable then the 
new actions J would be a system of I independent first integrals of the Hamiltonian 
flow of 7i in involution. Writing them explicitly in terms of the old local coordinates 
(J, x) ° ne has 

J = J + sJ 1 (J, X ) + 0(e 2 ), (8.2) 

for some smooth function J\ : V £ — ► U s . Imposing that {J, H} = 0(e 2 ) leads to 
the system of linear partial differential equations 

^^ dh dJ-ij df . 

^ai-ft^r^' ' = 1 --'- (8 ' 3) 

Using Fourier series try to find a smooth solution to these equations .... 



8.2 Constant Coefficients Linear PDE on T n and Loss of Differentia- 
bility. 

The (very) short sketch of the proof of Poincare's Theorem led us to consider the 
general constant coefficients linear partial differential equation on T n 

D^u :— fj,- du — v , (8.4) 
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where \i G M n , du = {d\u, . . . , d n u), is the gradient of u, v G C 0,oo (T n , IR m ) (i.e. 
v G C°°(T n ,R m ) and J Jn v(0)de = 0). Indeed for all fixed value of J the equation 
(8.3) is a special case of (8.4) with n — m = I. 

It is easy to check (see Appendix A3 for a detailed discussion of the case 
n = 2) that D^ is hypoelliptic 1 if and only if \x is a diophantine vector, i.e. there 
exist two constants 7 > and r > n — 1 such that 

\fi-k\ > 7 |A;r T VA; G Z n \ {0} , (8.5) 

where k = (k\, . . . k n ), \k\ = \ki\ + . . . + \k n \. 

Exercise 8.5 Prove that almost all jjl G W 1 is diophantine of exponent r > n — 1. 

Exercise 8.6 Have a look to the book of Y. Meyer [Me]. Among many interesting 
things one finds the following theorem (Proposition 2, p. 16) : Let 1Z be a 
real algebraic number field and let n be its degree over Q. Let a be the Q- 
isomorphism of 1Z such that cr(TZ) C K and let fii, . . .\x n be any basis of 1Z over 
Q. Then (a(/Ui), . . . , a(fi n )) G M n is diophantine of exponent r — n — 1. Try to 
prove it if you remember a tiny bit of Galois theory. Apply it to (1, \/2, v^, \/G) 
and (1, 2 1 ' 3 , 2 2 ' 3 ). There exist also higher dimensional generalizations of Roth's 
Theorem quoted in Exercise 4.9 : see, for example, the Subspace Theorem [Schl, 
Sch2]. 

In addition to knowing that u G C°°(T n ,M m ) one has the following more precise 
estimate : 

Proposition 8.7 Let || ||fc denote theC k norm. If /i is diophantine with exponent 
t then for all r > r + n — 1 and for all i G N there exists a positive constant Ai 
such that 

\\u\\i < Ai\\v\\i +r . (8.6) 

Proof. Let u{6) = Xlfcez™ Uke 27Tlk ' e , where obvoiusly one has 

u k = f u (6)e- 27Tlk - e d9 . 



1 A constant coefficients linear partial differential operator P is hypoelliptic if 
all u such that Pu = v are C°° on all open sets where v is C°° (see [HI], p. 109). 
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Then the C fc -norm can be equivalently given in terms of Fourier coefficients : for 
all z G N there exists a positive constant Bi such that 

B' 1 sup [(1 + \k\Y\u k \] < \\u\\i < B % sup [(1 + \k\Y +n+1 \u k \] • (8.7) 

Comparing the Fourier coefficients of u with those of v one has 

The desired estimates are an easy consequence of (8.7), the assumption that [i is 
diophantine and of the elementary fact ^ fceZ ™\ r \ \k\~ S < +oo for all 5 > n. □ 

The fact that one needs r more derivatives to bound the norms of u in terms 
of those of v is what is called the "loss of differentiability" . As we have already 
seen in Chapter 6 this is a typical phenomenon associated to small divisors. The 
analogue in the analytic case would be the necessary restriction of the domain to 
control the maximum norm of u in terms of v by means of Cauchy's estimates as 
we did in Section 6.3. 

In both cases (smooth and analytic) these are not artefacts of the methods 
used but a concrete manifestation of the unboundedness of the linear operator 
D~ . The main consequence of this fact is that one cannot use Banach spaces 
techniques to study semilinear equations like D^u = v + sf(u), where e is some 
small parameter. These semilinear equations are however typical of perturbation 
theory. 



8.3 KAM Theory, Nekhoroshev Theorem, Arnol'd Diffusion 

Despite Theorem 8.2, most results on quasi-integrable systems have been obtained 
under the assumption of non-degeneracy (i.e. the hessian matrix of Hq is non 
degenerate thus the frequency map J i— ► i>q(J) = ^ff-(J) 6 1 ! is a local 
diffeomorphism) but accepting the fact that one cannot hope for integrability on 
open sets. 

The general picture is provided by KAM [Arf ,Ga, Bo, Yol] and Nekhoroshev 
[Ne, Lo] theorems : if e is sufficiently small, most initial conditions (w.r.t. Lebesgue 
measure) lie on invariant /-dimensional lagrangian tori carrying quasiperiodic 
motions with Diophantine frequencies. The action variables corresponding to these 
orbits will remain e-close to their initial values for all times. The complement 
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of this set is open and dense and it is connected if / > 3. It contains a 
connected (/ > 3) web 1Z of resonant zones corresponding to Z^-linearly dependent 
frequencies : U fce ^{J G U ,vq(J) • k = 0} x T\ Motion along these resonances 
cannot be excluded (see [Ar2] for an explicit example), resulting in a variation of 
(9(1) of the actions in a finite time 2 . But if the hamiltonian is analytic and ho 
is steep (for example convex or quasi convex) then this variation is very slow : it 
takes a time at least O (exp (-^)) to change the actions of 0(s b ), where a and 
b are two positive constants. Moreover each invariant torus has a neighborhood 
filled in with trajectiories which remain close to it for an even longer time. Indeed, 
if ho is quasi-convex one can prove [GM] that all trajectories starting at a distance 
of order p < p* from a Diophantine i-torus of exponent r will remain close to it 

for a time O I exp I exp I — J J J . 

One of the consequences of KAM theorem [Po] is the existence, for sufficiently 
small values of s, of a Cantor set N e of values of the frequencies v for which the 
Hamiltonian system (8.1) has smooth invariant tori with linear flow. Moreover 
there exists a homeomorphism F e : N e x T l — ► U x T' e-close to the identity, 
Whitney smooth w.r.t. the first factor and analytic w.r.t. the second (if the 
Hamiltonian (8.1) is analytic) which transforms Hamilton's equations into v = 0, 
ip — v. This foliation into invariant tori is thus parametrized over a Cantor set and 
hence nowhere dense. It exhibits the phenomenon of "anisotropic differentiability" 
since it is much more regular tangentially to these tori than transversally to them 
(see also [BHS]). 



2 It is conjectured [AKN, p. 189] that generically quasi-integrable hamiltonians 
with more than two degrees of freedom are topologically unstable 
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9. The Inverse Function Theorem of Nash and Moser 

The Inverse Function Theorem for Banach spaces is one of the extremely useful 
standard tools in the study of a variety of non-linear problems, ranging from the 
good position of the Cauchy problem for ordinary differential equations to non- 
linear elliptic equations. Unfortunately the "loss of differentiability" typical of 
small divisors problems prevents from its use (with some remarkable exceptions 
however, see Section 6.2 and [He2]). In the analytic case, Kolmogorov suggested 
the use of a modified Newton method to overcome this difficulty but in the 
differentiable case the need of an Inverse Function Theorem in Frechet spaces 
has also other sources : its origin is the solution of the embedding problem 
for Riemannian manifolds by Nash [N]. Later Moser discovered how to adapt 
Kolmogorov's idea to the differentiable case creating a theory with a wide spectrum 
of applications [AG, Gr, H2, Ha, Ni, Ser, St, SZ, Zel] : to geometry, to the study 
of foliations and deformations of complex and CR structures, to free boundary 
problems, etc.. In all these cases a non-linear partial differential equation is 
solved using a rapidly convergent iterative algorithm introducing at each step 
of the iteration a smoothing of the approximate solution. 

In this Chapter we will follow the presentation of [Ha] very closely. 

9.1 Calculus in Frechet Spaces 

Definition 9.1 A Frechet space is a locally convex topological vector space (lctvs) 
which is complete, Hausdorff and metrizable. 

Exercise 9.2 Show that a lctvs X is Hausdorff if and only if x E X , \\x\\i = Vi E 
X then x = (where (|| • ||i)igx is the collection of seminorms giving the topology 
of X). Show that X is metrizable if and only if X is countable. 

Exercise 9.3 Show that 1R°° (space of all sequences of real numbers), C°°(M) 
(where M is a smooth compact manifold), ./4(C) (entire functions) are Frechet 
spaces (thus the exercise asks you to define suitable seminorms). Show that Co(M.) 
(continuous functions with compact support) with the usual topology (f n — ► / 
if and only if there exists a compact interval I such that supp f n C I for all 
sufficiently large n, supp / C I and f n converges uniformly to / on I) is a lctvs 
but it is not a Frechet space since it is not metrizable. 
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Exercise 9.4 Prove that Hahn-Banach Theorem holds in Frechet spaces : if X 
is a Frechet space and x is a non-zero vector in X then there exists a continuous 
linear functional / : X — *■ R(orC) such that l(x) = 1. This allows to introduce 
quite straightforwardly X-valued analytic functions [Va]. A function x : O — ► X, 
where O is a region in C, is analytic if and only if for all / G X* the function / o x 
is analytic. Show that this is equivalent to asking that, for all zq G O, x has a 
convergent power series expansion at zq : x(z) = Y^=o( z ~ z o) Ux n- 

Exercise 9.5 Extend the theory of Riemann's integration, including the funda- 
mental theorem of calculus, to continuous X -valued functions on [a, b] C R. 

Definition 9.6 Let X, Y be two Frechet spaces, U C X be open, f : U — > Y be 
continuous. The derivative of f at x G U in the direction of h G X is 

Df(x).h:=lun f{X + t ^- f{x) . (9.1) 

f is C 1 on U if and only if Df exists for all x G U and for all h G X and 
Df : U x X — > Y is continuous. 

Remark 9.7 In the case of Banach spaces this definition of C 1 is weaker than the 
usual one. 

Exercise 9.8 Prove that the composition of C 1 maps is C 1 and that the chain rule 
holds : D(g of)(x)-h = Dg(f(x)) ■ (Df(x) ■ h). 

Exercise 9.9 Define higher order derivatives and C k maps between Frechet spaces. 

Exercise 9.10 Let / : C°°([a,b]) -»■ C°°{[a,b}), f(x) = P(x,x', . . . , x^), where 
P G K[Xo, . . . X n ], is C°°. Is there a nice formula for Df(x) ■ h ? [Hint : start from 
monomials like (x^ l ') k .] 

The following examples show why the extension of the inverse function theorem 
to Frechet spaces is not a straightforward generalization of the inverse function 
theorem in Banach spaces but needs some extra assumption. 

The map x ^ f(x) = sinir, where x <E X = L 2 ([0, 1]), is of class C 1 according 
to Definition 9.6. Its derivative Df(Q) =identity but / is not invertible : /(0) = 
and the functions x n (£) = 7TX[o,i/n](0) where X[o,i/n] denotes the characteristic 
function of the interval [0, 1/n], converge to x = but f(x n ) = for all n. 
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Another example is obtained taking X = C°°([— 1, 1]) and considering the map 
/ : X -> X defined as /(a;)(f) = x(f) - fx(Oa'(f) for all £ e [-1, 1]. Then it is 
immediate to check that / is smooth and Df(x)-h = h~^x'h — ^xh' ', thus /(0) = 
and Df(0) =identity. But / is not invertible : the sequence x n (^) = — + ^- — > in 
C°°([— 1, 1]) asn-^ +oo but one can check that it does not belong to /(C°°([— 1, 1])) 
for all n > 1. [Hint : use the fact that if x E C°°([— 1, 1]) one can take its Taylor 
series at at any finite order and apply /. ] 

An even more interesting counterexample (see [Ha] for details) is the follow- 
ing : let M be a compact manifold, X = C°°(M,TM) be the Frechet space of 
smooth vector fields on M, Diff°°(M) be the group of smooth diffeomorphisms 
of M (it is a Frechet manifold, it's not very hard to figure out what this means, 
otherwise look in [Ha]). Then the usual exponential map 

exp : C°°(M,TM) -► Diff°°(M) 
v I— > exp(i> ) 

clearly verifies exp(O) = idM and .Dexp(O) =identity, but the exponential map is 
not invertible in general. Note that this would have meant that any diffeomorphism 
extends to a one parameter flow. 

For example a diffeomorphism of S 1 without fixed points is the exponential 
of a vector field only if it is conjugate to a rotation. But there exist [Yo3] 
diffeomorphisms of S 1 arbitrarily close to the identity which are not conjugate 
to a rotation. 

What goes wrong in all these examples is that although the derivative of the 
map is the identity at the origin it fails to be invertible at nearby points. Indeed in 
the second example above one has Df(l/n) -£ fc = (l — ^) £ fc , thus Df{l/n)^ n = 0. 

Thus one has to require the invertibility of Df on a neighborhood explicitly 
and this is usually difficult to be checked. In a Banach space (with the usual 
definition of derivative of a map instead of Definition 9.6) this is not needed. 

9.2 Tame Maps and Tame Spaces 

Definition 9.11 A graded Frechet space X is a Frechet space with a collection 
of seminorms (|| ||) n eN which define the topology and are increasing in strength 

\\ x \\o < IMIi < IMI2 < • • • VxeX. 



60 



Definition 9.12 Let X, Y be graded Frechet spaces, U C X open, P : U — > Y be 
a continuous map. P is tame if for all xq E U there exists a neighborhood V C U 
of xq and a non negative integer r such that for all i E N there exists Ci > such 
that 

\\P(x)\\i < Ci(l + ||x|| i+r ) Vx E V . (9.2) 

A C k tame map is a C k map P such that D^P is tame for all < j < k. 

The most typical example of a tame operator between Frechet spaces is given by 
nonlinear partial differential operators on compact manifolds. If P : C°°(M) — > 
C°°(M) is a smooth function of x E C°°(M) and its partial derivatives of degree 
at most r then we say that the degree of P is r and this will be the "loss 
of differentiability" in (9.2). The proof of this fact is given in [Ha] and uses 
Hadamard's inequalities for functions x E C°°(M) : for all n E N and for all 
integer k such that < k < n there exists Ck, n > such that 

\\x\\ k < C k ^\\x\\ k n /n \\x\\l~ k/n Vx E C°°(M) and . (9.3) 

Exercise 9.13 Prove that the composition of two C k tame maps is a C k tame 
map. 

Definition 9.14 A graded Frechet space X is tame if it admits smoothing 
operators, i.e. a one-parameter family S(t) : X — ► X, t E [1, +oo), of continuous 
linear operators such that there exists a non negative integer r and positive real 
constants (C n ,fc) n ,£;eN such that for all x E X and for all t E [1, +oo) and for all 
k E {0, 1, . . . , n} one has 

\\S(t)x\\ n <C n , k t n ~ k \\x\\ k , N 

k-n ( 9 - 4 ) 

\\X (3 (tJCC || fc < (-^k,nt ll^lln 



Exercise 9.15 (convolution with regularizing kernels) Let ip E C^(M n ) 

and assume that ip > 0, ift = 1 near 0. Let ip be the Fourier transform of ij) : 
fit) = J^^^e-^^dn. Let t > 1, <p t (0 = t n if(ti). Define S(t)f = <p t *f, 
where / E C°°(T n ). Show that (S(t))t>i is a family of smoothing operators on 

C°°(T n ). 

Exercise 9.16 Prove that in a tame Frechet space Hadamard's inequalities hold : 

\\x\\i < C(k,n)\\x\\l~ a \\x\\^ Vk<l<n , I = (1 - a)k + an . 
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[Hint : use (9.4) with t = ||a:||n /(n " fc) ||x||- 1/(n - fc) .] 

9.3 The Nash-Moser Theorem 

We can finally state Nash-Moser's [N,M] implicit and inverse function theorems. 



Theorem 9.17 (implicit function) Let X, Y, Z be three tame Frechet spaces, 
U C X x Y open, $ : U — ► Z a tame C r map, 2 < r < oo. Let (xo, yo) G U . 
Assume that there exists a neighborhood Vq of (xq, yo) and a continuous z-linear 
tame map L : Vq x Z — ► Y, ((x, y), z) \— ► L(x, y) ■ z, such that if (x, y) G Vq then 
D y $>(x,y) is invertible with inverse L(x,y). Then xq has a neighborhood W on 
which ^ G C r (W, Y) is defined and such that ^{xq) = yo and for all x G W one 
has (x, \P(x)) G U and $(x, *(x)) = $(x , yo)- 



Theorem 9.18 (inverse function) Let X,Y be two tame Frechet spaces, 
U C X open, $ : U — ► Y a tame C r map, 2 < r < oo. Let a;o E U, yo — 3>(xo). 
Assume that there exists a neighborhood Vq of xq and a continuous y-linear tame 
map L : Vq x Y — > X, (x, y) \— > L(x)-y, such that ifx G Vb then D$(a;) is invertible 
with inverse L(x). Then xq has a neighborhood V C Vq and yo has a neigborhood 
W such that $ : V — > W is a tame C r diffeomorphism. 



Exercise 9.19 Show that the two previous theorems are equivalent. 

We refer the reader to [Ha] for the proofs of Theorems 9.17 and 9.18. The main 
idea of the proof is to use a modified Newton's method for finding the root of the 
equation 3>(x) = y. It makes use of the smoothing operators S(t) to guarantee 
convergence. Here we will content ourselves with a brief sketchy description of the 
argument. 

Without loss of generality we can assume xq = yo — 0. An algorithm for 
constructing a sequence Xj G X which will converge to a solution x of $(x) = y (for 
small enough y) is the following : fix a sequence tj = e^ 3 / 2 ^, so that tj+i = t-' , 
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and let 

xq = (initial guess ) , 

Xj = . . . O'-th guess ) , 

Zj = y — $(£7) 0' _ th error ) , 

Axj — S(tj)L(xj)zj O'-th correction) , 

Xj+i = Xj + Axj (j + 1-th guess ) . 

The idea to show convergence of this algorithm is the following : let 

R(x,h)= / D 2 $(x + th)(h,h)dt 
Jo 

denote the quadratic integral remainder in Taylor's formula. Since 

$(x + h) = $(x) + D$(x) ■ h + R(x, h) 
one gets 

zj+i = y- ®(xj+i) =y-®(xj + &Xj) 

= Zj — D3>(xj)S(tj)L(xj)zj — R(xj,Axj) . 
Using the identity Zj = D&(xj)L(xj)zj we find 



z 



+ i = D$(xj)[I - S(tj)]L(xj)zj + R{xj, Axj) . 



The first term tends to zero very rapidly since S(tj) — >■ / as j — >• +00 and the 
second term is quadratic. 

This short description of the idea of the proof makes also clear why one needs 
the assumption $ at least of class C 2 (in Banach spaces C 1 is enough). 
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10. From Nash-Moser's Theorem to KAM : Normal 
Form of Vector Fields on the Torus 

Following Herman we will prove in this Chapter a normal form theorem for vector 
fields on the torus which can be considered as the basic KAM theorem in higher 
dimension (without taking the symplectic structure into account) . The proof will 
be an application of Nash-Moser's Theorem. For a proof of KAM theorem see, 
for example, [Bo]. 

Let Diff°° (T , 0) denote the group of C°° diffeomorphisms / of the torus T' 
homotopic to the identity and such that /(0) = 0. This space can be identified to 
an open subset of the tame Frechet space C°°(T l ,R l ,0) = {u G C°°(T Z ,M Z ) , u(0) = 
0} : u corresponds to a diffeomorphism / if and only if for all x G T' one has 
id + du(x) G GL (/, R). In this case one has / = idji + u. 

Since the tangent bundle of T l is canonically isomorphic to T' x R one can 
also identify the space of C°° vector fields on the torus with C°°(T l , M. 1 ). 

Let \i G M} be Diophantine with exponent r and constant 7. We will denote 
i? M the translation by \i on the torus T l : R^,(xi, • • • , Xi) — (Xi + Mi, • • • , Xi + Hi)- 

Exercise 10.1 Show that the map 

/ : Diff°°(T', 0) -► Diff°°(T z , 0) 
f^I(f) = f- 1 

is a tame C°° map. Its derivative is 

DI(f)-h = -[(df)- 1 -h]of- 1 . (10.1) 

The following statements (and proof) are taken from ([Bo], pp. 139-141) and 

[He5]. 

Theorem 10.2Let fx eR l . The map 

d^ :Difr(T l ,0)xR l ^Difr(T 1 ), 

(f,v)^R„ofoR^of- 1 , 



(10.2) 



is a tame C°° map. Moreover, if /i is a diophantine 1 vector then $ M is a tame C c 
local diffeomorphism near f = idji, v = 0. 



1 In this situation \i is diophantine if there exist two constants 7 > and r > I 
such that \n ■ k + p\ > r y\k\~ T for all fceZ'\ {0} and for all p G TL. 
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The meaning of the second part is that when ii is diophantine the diffeomorphisms 



of the torus T l conjugate to the translation R^ by a diffeomorphism close to 
the identity form a Frechet submanifold of codimension / of Diff°°(T z ) which is 
transverse in li to the space R l of the translations on the torus. 

Exercise 10.3 Guess the statement of for vector fields equivalent to Theorem 
10.2. 

Here is the solution : 

Theorem 10.3Let 11 G R l . The map 



^ M : Dijf°(T l ,0) xR l ^C°°(T l ,R l ) , 

(/, v) >-> v + f*n = v + df o f- 1 ■ n 



(10.3) 



is a tame C°° map. Moreover, if ii is a diophantine vector (see (8.5) ) then \1/ M is 
a tame C°° local diffeomorphism near f = idji, v = 0. 

Proof. First of all note that ^(idx*, 0) = \i. The first assertion is an immediate 
consequence of Exercises 9.13 and 10.1. Moreover, using (10.1), one easily checks 
that 

£>*„(/, v) ■ (A/, Av) = Av + (dAf) of- 1 , ft 

+ d 2 f o r 1 ■ (-(df)- 1 o r * • Af o r\ p) 

= Av+[(dAf).Li 

+ d 2 f-(-(df)- 1 -Af^)]of- 1 



(10.4) 



(we recall that here one has Af e C°°(T l , R l , 0), Av E R l ). 
If one introduces u, writing Af = df ■ u then one gets 

(dAf)o f- 1 . n= [&f -u. fi + df -du. n]o f- 1 

and 

o 2 f o r 1 ■ (-(df)- 1 of- i -Afof-\ fi ) = -[d 2 f ■ u ■ ix] o r 1 . 

Therefore (10.4) simplifies considerably and becomes 

D*M v) ■ (df ■ u, Av) = Av + (df ■ du ■ fi) o f- 1 . (10.5) 
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To prove the second assertion we will apply Theorem 9.18 to $ = \P M at the points 
xq = (id T ; , 0) and yo = [i. We must just check that _D\I/ M (/, v) is invertible for all 
(/, v) in a neighborhood of (id T (, 0). This leads us to the equation 

Av + (Of ■ du ■ fi) o f- 1 = w . (10.6) 

Composing on the right with / and multiplying both sides by (<9/) _1 one gets 

V-du = (df)- 1 -[wof- Av] , (10.7) 

i.e. an equation of the form (8.4) with v = (<9/) _1 -[wo f — Av\. This clarifies why 
one needs the term v in the definition (10.3) of \& M : indeed one fixes it so as to 
assure that v G C 0,oo (T l , M. 1 ), i.e. it has zero average on the torus T . One can also 
check that the map (/, w) i— ► Av is tame. 

Proposition 8.7 allows to conclude since it shows that the map (/, v) i— >■ u = 
D~ x v is tame. D 
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Appendices 



Al. Uniformization, Distorsion and Quasi-conformal maps 

In this appendix we recall some elementary and less elementary facts from 
the theory of conformal and quasi-conformal maps of one complex variable. 

Al.l A nonempty connected open set is called a region. 

Theorem Al.l (The Maximum Principle) If f(z) is analytic and non- 
constant in a region Q of the complex plane C, then its absolute value \f(z)\ 
has no maximum in Q. 

Proof. It is an easy consequence of the fact that non-constant analytic functions 
map open sets onto open sets. □ 

The maximum principle implies that if / is defined and continuous on a compact 
set K and analytic in the interior of K then the maximum of |/(^)| on K is 
assumed on the boundary of K. Another easy consequence is the following 

Exercise A1.2 (Schwarz's Lemma, automorphisms of the disk) Schwarz's 

Lemma : If \f(z)\ is analytic for \z\ < 1 and satisfies the conditions \f(z)\ < 1, 
/(0) = 0, then \f(z)\ < \z\ and |/'(0)| < 1. If \f(z)\ = \z\ for some z # or if 
|/' (0)| = 1 then f{z) = cz with c € C, |c| = 1. Automorphisms of the disk : Show 
that if / : D — ► D is an automorphism of the disk and /(0) = then |/'(0)| = 1 
and / is a rotation. Deduce from this that the group of automorphisms of the unit 
disk D is 

Aut (D) = {z .-> T(z) = ^±1 , a , b e C , \a\ 2 - \b\ 2 = 1}] . 

bz + a 



A 1.2 A mapping / of a region O into C is called conformal if it is holomorphic and 
injective. Such maps are also called univalent. Since an analytic map is injective 
if and only if /' ^ if / is univalent in Q its derivative never vanishes, i.e. it has 
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no critical points inside O. The most important result of the theory of conformal 
maps is certainly the 

Theorem A1.3 (Riemann Mapping Theorem) Given any simply connected 
region Q which is not the whole plane, and a point zq G O there exists a unique 
conformal map (the Riemann map) f : D — ► O such that f is onto, /(0) = zq and 
/'(0) > 0. 

Exercise A1.4 Drop the requirement /'(0) > 0. Then / is not unique but the 
number |/'(0)| does not depend on /. It is called [Ahl] the conformal capacity of 
Q with respect to zq and it will be denoted C(0, zq). [Hint : use Exercise Al.3] 

One should not think to an arbitrary simply connected region as the "potato" of 
PDEs but as a rather irregular object. For example the boundary needs not to be 
locally connected. 

Al.3 Assume that the region O is bounded and dVt is a closed Jordan curve (i.e. 
<9Q = 7([0, 1]), where 7 : [0, 1] — ► C is continuous, 7(0) = 7(1) and 7(^1) = 7(^2) 
if and only if t\ = £2 or t\ — 0, £2 = 1)- In this case the Riemann map / : D — ► O 
has a nice boundary behaviour : 

Theorem A1.5 (Caratheodory) A Riemann map f : D — »■ O extends to a 
homeomorphism of D onto Q if and only if dfl is a closed Jordan curve. 

The topic of the boundary behaviour of conformal maps is very rich and it's an 
active research area : we refer to [Po] for more informations and references. We will 
only need two other results : the first extends Caratheodory's theorem dropping 
the assumption that the restriction of / to the boundary of the disk is injective. 

Theorem A1.6 Let f : D — > O be a Riemann map. The following four conditions 
are equivalent : 

(i) f has a continuous extension to D ; 

(ii) <9fi is a continuous curve, i.e. <9fi = {<£>(() ,( E T} with <p continuous ; 
(in) 90 is locally connected ; 
(iv) C \ O is locally connected. 
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Our second result, due to Fatou, applies to all / : D — > C holomorphic and bounded 
(thus we are dropping the assumption of / being injective in D). 

Theorem A1.7 (Fatou) Let / : D — ► C be holomorphic and bounded. Then f 
has a non-tangential limit at almost all points (G T = <9B. Moreover if f is not 
identically zero then y?(C) = lim r ^i_ f(rQ (which belongs to L°°(T)) is not zero 
almost everywhere. 



A 1.4 Another fundamental result is the celebrated 

Theorem A1.8 (Uniformization Theorem) The only simply connected Rie- 
mann surfaces, up to biholomorphic equivalence, are the Riemann sphere C = 
C U {oo}, the complex plane C and the unit disk D. 

Exercise A 1.9 Prove that the group of automorphisms of the Riemann sphere is 
the group PGL (2, C) acting by homographies : if g — I , J G PGL (2, C) then 

z i— ► q ■ z = — 
affine group. 



z i— ► g • z — a c z z Vd - ^ ne g rou P °f automorphisms of the complex plane is simply the 



A1.5 One can consider univalent functions / on regions of C with values in C : 
in this case / must be meromorphic and injective. Here are some elementary 
properties : 

Exercise ALIO 

(a) If / is univalent on a region OcC then / is analytic except for at most a 
single simple pole and /' never vanishes. 

(b) If / : O — > Q! is onto and univalent then f~ l : O' — > O is also univalent. 

(c) A univalent map is a homeomorphism. 

(d) A univalent map preserves angles between curves and their orientation (that's 
why they're called conformal !). 

(e) The composition of univalent maps is univalent ; / is univalent if and only if 
1/ f is univalent. 

Exercise A 1.11 Prove that if / : O — ► C is univalent and A C O is measurable 
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then 



Area(/(A)) = f \f'(x + iy)\ 2 dxdy . 

J A 



Exercise A1.12 (The Area Formula) Show that if / : D — ► C is univalent, 
letting f(z) = T,n=o /^ n > one has 



Area (/(D)) = ttJ>|/ 



2 



n=l 



[Hint : First consider the disk D r of radius r < 1. If / = u + iv then 
Area (/(D)) = J dBr udv = \ J dBr fdj. Then let r - 1-.] 

Let Si denote the collection of functions / univalent in D and such that /(0) = 0, 
/'(0) = 1, thus f(z) = z + J2Z 2 + — With Ei we will denote all functions 
g (Q = ( + 9o + g^- 1 + ... univalent in the outer disk E = {C G C, |C| > !}• 
Clearly if / E S\ then g(£) = 1//(C _1 ) belongs to Si and omits 0. Conversely, 
if g G Ei and g(() ^ for all C G E then f(z) = l/g(z~ l ) belongs to S\. One 
of the most important results on univalent functions is the object of the following 
exercise : 

Exercise A1.13 (Area Theorem) If g G Si then \g^\ < J^^Li n\g n \ 2 < 1. Prove 
that equality holds if and only if g(() = C + fi'o+fi'iC -1 with |<7i| = 1. [Hint : show 
that area(C \ g(E)) = n(l - £"=i n\g n \ 2 ).} 

One of the main consequences is the following apparently innocent bound : if 
f E Si then 

l/ 2 |<2, (Al.l) 



as one can easily check applying the Area Theorem to g(Q = a//(C -2 )- However 
this estimate will have many important consequences as we will see soon. A (much 
harder and for a long time conjectural) result is the celebrated [DeB] 

Theorem A1.14 (Bieberbach— De Branges) If f E Si then \f n \ < n for all 
n. 

Using (Al.l) one can easily show that the image of D through a univalent map 
cannot be too small : 
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Theorem A1.15 (Koebe 1/4-Theorem) If f € Si then D 1/4 C /( 



Proof. Let ro£D and assume w ^ /(D). Then 

wf(z) 



/(*) 



w - f(z) 



z+(f 2 + w- L )z z + ... 



belongs to 5"i . Applying (Al.l) to both / and / one gets \w\ 1 < I/2I + I/2+W 1 | < 
4. D 

The Koebe function f(z) = z(l — z)~ 2 = ^Z n -iTiz n ma P s the unit disk D 
conformally onto C \ (—00,— 1/4). Therefore Bieberbach-De Branges' Theorem 
and Koebe 1/4-Theorem are optimal. If / is univalent and analytic in D, given 
any zq G D the Koebe transform of / at zq 



K ZoJ (z) 



'( 



Z + ZQ 

1+202; 



- f(zo) 



(i-\z \rr(z ) 
(i-\z \) 2 nz ) _ 



(A1.2) 



z + 



2/'(*o) 



Zq 



z z + ... 



belongs to Si. This is a very useful tool in order to transfer the information at 
to information at any point of the disk. Applying systematically this idea, from 
(Al.l) one deduces the following important distorsion estimates : 

Exercise A1.16 (Koebe distortion theorems) If / maps D conformally into 

C then Vz G D one has : 



,2>/"M 



(1-W^ -» 



<4 



l/'(0)| 



::i <|/(*)-/(o)|<|/'(o)| ::; 



(i + M) 2 
If (0)1 1_u 



(i+\ z \r 



<\f'(z)\<\f'(0)\ 



1 



(1-kl) 2 
l + \z\ 

(i-\z\r 

2\l fit 



(1 - \z\')\f{z)\ < dist(/(z),a/(D)) < (1 - \zn\f'(z)\ . 



(A1.3) 
(A1A) 
(A1.5) 
(A1.6) 



These estimates show that the growth of / as z approaches <9B cannot be faster 
than (1 — r) -2 , where r = \z\. The next theorem (see [Po]) for a proof) shows that 
the average growth is much lower than (1 — r) -2 . 
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Theorem A1.17 Let f map D conformally into C. Then /(£) = lim r ^i f(rQ 7^ 
00 exists for almost all C G T = <9B and for < r < 1 one has 



»2ir 



i- J " \f(re ie - f(0)\ 2 / 5 dt < 5|/'(0)| 2 / 5 . (Al.7) 



A1.6 We conclude our brief introduction to univalent functions with the proof of 
a fundamental property of Si. In order to do this we recall ([Re], p. 163) 

Lemma A1.18 (Hurwitz) If a sequence (/ n )neN of functions holomorphic in 
a region OcC converges uniformly on compact subsets of Q to a non-constant 
holomorphic function f : — »■ C then the following statements hold : 

(a) if all the images / n (0) are contained in a fixed set A then /(0) C A ; 

(b) if all the maps f n : — *■ C are injective then so is f : — *■ C ; 

('cj if all the maps f n : O — >• C are locally biholomorphic, then so is f : fi, — ► C. 

This is the ingredient we missed for the proof of the following 

Theorem A1.19 Si endowed with the topology of uniform convergence on 
compact subsets ofD is a compact topological space. 

Proof. Any sequence (/ n )neN C Si is equicontinuous and uniformly bounded on 
compact subsets of D by Koebe distortion theorems. Limit functions are in Si 
because they are univalent by Hurwitz's lemma and the normalisation |/^(0)| for 

all neN. □ 

Exercise A1.20 Prove that Theorem Al.19 is equivalent to the following (see 
[Mc]) : the space of all univalent maps / : D — > C is compact up to post- 
composition with automorphisms of C. This precisely means that any sequence of 
univalent maps contains a subsequence f n : D — ► C such that M n o f n converges 
to a univalent map /, uniformly on compact subsets of D, for some sequence of 
Mobius transforms M n E PGL (2, C). 

Al.7 Let / : C — > C be a C 1 orientation-preserving diffeomorphism. Then given 
any point zoGC one has 

f(z) = f(z ) + f z (z )(z - z ) + f- z (z )(z -z )+o (\z -zq\), (A1.8) 
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where 

If Of .df\ 1/0/ df\ 

Note that if / is analytic in zq then fz(zo) = (Cauchy-Riemann). The Jacobian 
determinant of / is J = \f z \ 2 — \fz\ 2 - Since / is orientation-preserving one has 
J>0, thus \f z \ > \f z \. 

Definition A1.21 The dilatation of f in zq is 

n / x \fz(zo)\ + \fz(zo)\ 

D f( z o) ■= I , 7 X, 1 , 7 x, > 1 • (ALIO) 

|/*Oo)|-|/?Oo)| 

Note that if / is conformal then £)y = 1. 

Here is a geometric interpretation of the meaning of the dilatation : the 
differential df(zo) maps a circle in the tangent space T Zo C into an ellipse in 
Ty(z )C. The dilatation measures the distorsion since it is the ratio of the major 
semiaxis amd the minor semiaxis. Indeed applying (Al.8) to an infinitesimal circle 
Az = ee %e centered at zq one finds an infinitesimal ellipse centered at f(zo) with 
major semiaxis [\f z \ — |/z|] _1 £ and minor semiaxis [\f z \ + \f z \]~ 1 e. 

The maximal dilatation of / on C is Df = sup zeC Df(z). If Df < +oo, let 
Kf = (D f - !)/(£>/ + 1). Then one has -4 s - < «/ < 1. 



Definition A1.22 / is quasiconformal if Df < +oo, i.e. Kf < 1. 

Clearly if / is conformal then Df = 1, Kf = 0. 

We want now to extend the notion of quasiconformal map to homeomor- 
phisms. We will follow the geometric approach outlined in [Ah2]. 

Exercise A 1.23 Given two rectangles R± and R2 respectively with sides a\ < b\ 
and 02 < &2 5 show that there exists a conformal map of R\ onto R2 which maps 
vertices on vertices if and only if ^- = ^ . 

Definition A1.24 A quadrilateral Q(z±, 22, 23, Z4) is a Jordan domain in C with 
four distinguished boundary points z±, Z2, Z3, Z4. Its modulus M(Q) is the ratio 
a/b of the lengths a < b of the sides of any rectangle R which is the conformal 
image of Q and whose vertices are image of the distinguished points. 
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Note that the modulus of a quadrilateral is a conformal invariant. Thus one can 
use its variation under a homeomorphism to measure the lack of conformality of 
a map. 

Definition A1.25 Let f : C — ► C be an orientation-preserving homeomorphism. 

Its maximal dilatation is D f := ^V Q ^ c ,Qquadr Hater al^M^) 1 - Let K f = 
(Df — l)/(Df + 1). f is a quasiconformal homeomorphism if Df, +00, i.e. Kf < 1. 

Exercise A 1.26 Prove that / is conformal if and only if Df = 1. 

Theorem A1.27 A quasiconformal homeomorphism f with maximal dilatation 
Df is almost everywhere differentiable and at each point zq where f is differentiable 
one has 



\fz(z )\ 



< 



K f 



Perhaps the most useful result in the theory of quasiconformal maps is the following 
existence theorem also known as Measurable Riemann Mapping Theorem [Ah2, 
p.98] 

Theorem A1.28 Let n be a complex-valued measurable function with ||//||oo < 1- 
There exists a quasiconformal mapping f such that 

fz = H-{z)f z almost everywhere (AT. 11) 

and f leaves the points 0, 1, 00 fixed. 

The equation (Al.ll) is also known as Beltrami equation. 

Quasiconformal maps have been introduced in the subject of holomorphic 
dynamics by Dennis Sullivan and Adrien Douady and have rapidly become a 
standard tool. What we will need in Chapter 3 is the following 

Theorem A1.29 (Douady— Hubbard : stability of the quadratic polyno- 
mial) Let P\(z) = X (z — 4j- J and let F(z) = P\(z)+ip(z) where ip is holomorphic 
and bounded in the disk D 3 , ip(z) = Yl < ^=2^nZ n (i.e. ip(0) = ip'(0) = 0)- Assume 
that sup z6D , 3 |V'( 2; )| < 10 -2 . Then there exists a quasiconformal homeomorphism 
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h such that on the disk O2 one has h 1 Fh = P\. If ip is small enough then h is 
near the identity in the C° topology. 



A2. Continued Fractions 

In this appendix we recall some elementary facts on standard real continued 
fractions (we refer to [MMY], and references therein, for more general continued 
fractions) . 

We will consider the iteration of the Gauss map 



A : (0, 1) h+ [0, 1] 



042.1) 



defined by 



A(x) = — 

x 



1 

X 



A is piecewise analytic with branches 



A(x) = x 1 — n if < x < — , n > 1 

n + 1 ~ n 



(A2.2) 



Exercise A2.1 Prove that A* (p(x)dx) = p(x)dx where p(x) = [(1 + x) log 2] 1 , 
i.e. p is an invariant probability density for the Gauss map. 



Let 



G = : — , g = G = 



2 2 

To each x G K \ Q we associate a continued fraction expansion by iterating A as 

follows. Let 

Xq = x — [x] , 

a = [x] , 
then one obviously has x = ao + xq. We now define inductively for all n > 



(A2.3) 



" 1 



O n +l 






> 1 



(A2A) 
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thus 

x~ x =a n+1 +x n+1 . (A2.5) 

Therefore we have 

1 1 . A . 

x = ao + xq = cio H = . . . = oo H i , (A2.6) 

«i H 

a 2 + ' • + 



and we will write 

x = [a ,ai,...,a n ,...] . (A2.7) 

The nth-convergent is defined by 

T) 1 

— = [a , oi, . . . , a n ] = a -\ — (A2.1 

Qn 



a i 


+ 




1 






a 2 + 




+ 


1 
a n 



Exercise A2.2 Show that the numerators p n and denominators q n are recursively 
determined by 

p_i = 9-2 = 1 , p_ 2 =q-!=0 , (A2.9) 

and for all n > one has 

Pn = a n Pn-l +Pn-2 , 
Qn = O-nQ.n-1 + 9n-2 • 

Exercise A2.3 Show that for all n > one has 



(A2.10) 



x n = QnX - Pn , (.42.12) 

Qn-lX-Pn-l 

q n p n _ l -p n q n _ 1 = (-l) n . (.42.13) 

Note that g n +i > q n > and that the sequence of the numerators p n has the same 
constant sign of x. Equation (A2.13) implies also that for all k > and for all 

x G R \ Q one has 2^ < x < ^2h±l_ 
Let 

/?„ = n? =0 Xi = {-l) n {q n x - pn) for n > 0, and /5_i = 1 . (.42.14) 

Then x n = (3 n f3~\ and (3 n - 2 = a n fin-\ + Pn- 
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Proposition A2.4 For all x G R \ Q and for all n> 1 one has 

1 1 

$ |</ n x - p n | = Qn+1 + QnXn+1 > so that 2 < n g n+1 < 1 ; 

fii; /3n < 9 n and q n > \c 



yn—l 



Proof. Using (A2.ll) one has 

Pn+1 + Pn%n+1 WnPn+l - PnQn+1 \ 

i . Pn 

Qn+1 + QnXn+1 

1 



I QnX Pi 



Qn+1 + QnXn+1 



Qn+1 + QnXn+1 



by (A2.13). This proves (i). 

Let us now consider j3 n = xoX\ . . .x n . If x k > g for some k G {0, 1, . . . ,n — 1}, 
then, letting m = x^ 1 — Xk+i > 1, 

x k x k+1 = 1 - mx k <l-x k <l-g = g 2 . 

This proves (ii). □ 

Remark A2.5 Note that from (ii) it follows that XlfcLo ° S9fc an d Sfclo ~ are 
always convergent and their sum is uniformly bounded. 

For all integers k > 1, the iteration of the Gauss map k times leads to the following 
partition of (0, 1) ; U ai ,...,a k I(ai, . . . , a k ), where Oj G N, i = 1, . . . , k, and 

( (Pk 2h+2h=L ) if A; is even 
Tl \ J V 9fc <ik+qk~i ) 

liau .... a k ) = < ) ( 

I (ss±es=I,2^ if fc is odd 

I, V Qk+qk-i ' gfc / 

is the branch of A fc determined by the fact that all points x G I(ai, . . . , a k ) have 
the first /c + 1 partial quotients exactly equal to {0, oi, . . . , a k }. Thus 

Tf \ I r/ml Pk+Pk-lV c / nr 

i(ai, ...,at = UG 0,1 x = ■ , y G (0, 1, 

I qk + qk-xy 

Note that -f- = -, — ^ — - — p- is positive (negative) if k is even (odd). It is immediate 
to check that any rational number p/q G (0,1), (p, q) = 1, is the endpoint of 
exactly two branches of the iterated Gauss map. Indeed p/q can be written as 
p/q = [a\, . . . , ajfc] with k > 1 and a k > 2 in a unique way and it is the left (right) 
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endpoint of I(ai, . . . , au) and the right (left) endpoint of I(ai, . . . , a,k — 1, 1) if k 
is even (odd). 



A3. Distributions, Hyper functions, Formal Series. Hypoellipticity 
and Diophantine Conditions. 

A3.1 We follow here [HI], Chapter 9 but we also recommend [Ph], especially the 
first few chapters, for a nice introduction to hyperfunctions and their applications. 
Let K be a non empty compact subset of R. A hyperfunction with support in K 
is a linear functional u on the space O(K) of functions analytic in a neighborhood 
of K such that for all neighborhood V of K there is a constant Cy > such that 

\u(<p)\ < C v svp\<p\ , Vv?eO(y). 
v 

We denote by A'(K) the space of hyperfunctions with support in K. It is a Frechet 
space : a seminorm is associated to each neighborhood V of K. 

Let 1 (C \ K) denote the complex vector space of functions holomorphic on 
(C \ K) and vanishing at infinity. One has the following 

Proposition A3.1 The spaces A'(K) and 1 (C\K) are canonically isomorphic. 
To each u G A'(K) corresponds <p G (9 1 (C \ K) given by 

<p(z) = u(c z ) ,VzeC\K, 

where c z (x) = \^ri- Conversely to each <p G (9 1 (C \ K) corresponds the 
hyperfunction 

u(ip) = — / <p(z)ip(z)dz , VV> G A 

27T J 1 

where 7 is any piecewise C 1 path winding around K in the positive direction. We 
will also use the notation 



u(x) = —[<p(x + iO) — <p(x — iO)] 

Zi I. 



for short. 



Proof. It is very easy : note that the function x 1— > c z (x) is analytic in a 
neighborhood of K for all z ^ K. Then it is immediate to check applying Cauchy's 
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formula that these two correspondences are surjective and are the inverse of one 
another. □ 



A3. 2 Let T 1 = R/Z C C/Z. A hyperfunction on T is a linear funtional U on the 
space 0(T l ) of functions analytic in a complex neighborhood of T 1 such that for 
all neighborhood V of T there exists Cy > such that 



\U($)\<C v svp\(p\, V$G0(y). 
v 

We will denote ^'(T 1 ). the Frechet space of hyperfunctions with support in T. 
For U G A'(T), let U(n) := U(e_ n ) with e n (z) = e 2ntnz . 

Exercise A3. 2 Show that the doubly infinite sequence {U(n)) ne i satisfies 

\U(n)\ <C e e 27r|n|e . 

for all e > and for all n G Z with a suitably chosen C e > 0. Conversely show 
that any such sequence is the Fourier expansion of a unique hyperfunction with 
support in T. 

Let 0-£ denote the complex vector space of holomorphic functions $ : C \ R — ► C, 
1-periodic, bounded at ±ioo and such that $(±zoo) := limQ mz ^± 00 <&(z) exist 
and verify $(+zoo) = — <&(— zoo). 

Exercise A3. 3 Show that the spaces ^'(T 1 ) and Oy, are canonically isomorphic. 
Indeed to each U G ^'(T 1 ) corresponds $ G Oy, given by 

$( z ) = t/(C 2 ) , Vz G C \ K , 

where C z {x) = cotg7r(x — z). Conversely to each $ G Oj; corresponds the 
hyperfunction 

U(V) = - f $(z)^(z)dz , V^ G ^(T 1 ) 
2 Jt 

where Y is any piecewise C 1 path winding around a closed interval I C R of length 
1 in the positive direction. We will also use the notation 

U(x) = —[&(x + iO) - $(x - i0)] 
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for short. 

The nice fact is that the following diagram commutes : 



A' ([0,1]) 



£, 



/ / r rrl'> 



A'(T 



O 1 (C\[0,l]) 



E, 



a 



the horizontal lines are the above mentioned isomorphisms, ^ z is the sum over 
integer translates : (Ez^X^) = EnezVX 2 - n). 

A3. 3 As we have seen in A3. 2 periodic distributions and hyperfunctions are 
naturally identified with the two following subspaces of the complex vector space 
of formal Fourier series 

+oo 

V? G V'(T) <=> <p(6) = Y^ fi(n)e 27Ttn9 and there existsM > , r > 

— OO 

such that \<p(n)\ < M\n\ +r Vn G Z* , 

+oo 

V? G A'(T) <£> ip(6) = J2^(n)e 27Tin9 and for alle > there existsC e > 

— oo 

such that |v?(n)| < C e exp(27r|n|£) Vn G Z . 
Let us now consider the following linear first-order difference equation on T 1 



f g (9 + a)-f 9 (9) = g(e) 

where a 6 1 \ Q. A necessary condition for the existence of a solution is that 
J g(0)d0 = 0. Thus we introduce the zero-mean Dirac delta function on T 1 

*r,o(0)= E e27Tin9 
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and we note that the corresponding f$ plays the role of a fundamental solution 
since 

f 9 = fs&9= J2 fs(n)g(n)e 2 ™ e = — J f 6 {6 - 6 1 )g(6 1 )d6 1 

n= — 00 ^ 

when the integral makes sense. 

On the other hand one clearly has 



fs(0) = J2 



p 1ixiriB 



plixiria 1 

as a formal power series. We have the following elementary Proposition which can 
also be taken as an equivalent definition of diophantine numbers 

Proposition A3. 4 fs is a distribution if and only if a G CD. fs is a 
hyperfunction if and only if the denominators q n of the convergents of a verify 

llm n^+oo — U. 

The proof is immediate and it is left as an Exercise. 

Note that the above discussion carries over easily to the linear PDE on the 
two-dimensional torus T 2 

(d dl +adg 2 )f = g 

(which is associated to the linear flow 9± = 1, 62 = a). In this case one 
has Sj2 j0 = Ylmei 2 , Q e 2 ' n " l ( riiei+ri292 ^ and the fundamental solution is fs = 

Snez 2 n^o " ' 2-Ki(n +n a) • Then Proposition A3. 4 holds also in this case, showing 
that the operator dg 1 + adg 2 is hypoelliptic if and only if a is diophantine. 
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List of symbols 

Ad : adjoint action 

c(Q, Zq) : conformal capacity of Q w.r.t. Zq 

C : complex plane 

C* : C\{0} 

C : Riemann sphere 

C{z} : ring of convergent power series in one complex variable 

C[[z]] : ring of formal power series in one complex variable 

Cent : centralizer 

Cent : formal centralizer 

Df(zo), Df : dilatiation of f (maximal) 

D : open unit disk {\z\ < 1} 

D r : open disk {\z\ < r} of radius r > 0. 

E : outer disk {\z\ > 1} 

[/] : orbit of / 

F(R) : Fatou set of R 

G : group of germs of holomorphic diffeomorphisms of (C, 0) 

G : formal analogue of G 

G\ : elements of G with linear part A 

G\ : formal analogue of G\ 

J(R) : Julia set of R 

N : non-negative integers 

O : a region of C 

Q : rational integers 

R\ : the germ R\(z) = Xz 

S : univalent maps on D 

S\ : elements of S with linear part R\ 

Sgi : elements of S with linear part of unit modulus 

S 1 : unit circle {\z\ = 1} 

u Yoccoz's function, see Section 3.1 

y : see Chapter 4 

Z : integers 
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